(19) 



EuropSisches Patentamt 
European Patent Office 
Office europden des brevets 



(11) 



EP 0 974 652 A1 



(12) 



EUROPEAN PATENT APPLICATION 

published in accordance with Art. 158(3) EPC 



(43) Date of publication: 

26.01.2000 Bulletin 2000/04 

(21) Application nurT4)er: 98911227.1 

(22) Date of filing: 07.04.1998 



(51) H.C\7: C12N 15/12. C12P 21/02, 
CI 2N 5/10, C07K 14/47, 
C07K 16/18, G01N 33/53, 
G01 N 33/574 

(86) International application number: 
PCT/JP98/01592 

(87) International publication number: 

WO 98/45431 (15.10.1998 Gazette 1998/41) 



(84) 


Designated Contracting States: 


• ARAKAWA, Hlioharu 




AT BE CH CY DE DK ES R FR GB GR IE IT LI LU 


Banyu Pharmaceutical Co., Ltd. 




MCNLPTSE 


Ibaraki 300-2611 (JP) 


(30) 




• OHTA, Masataka 


Priority: 08.04.1997 JP 10533397 


Banyu Pharmaceutical Co., Ltd. 


(71) 




Ibaraki 300-2611 (JP) 


Applicant: 




BANYU PHARMACEUTICAL CO., LTD. 


(74) Representative: 




Chuo-ku, Tokyo 103-8416 (JP) 


VOSSIUS & PARTNER 


(72) 




Siebertstrasse 4 


Inventors: 


81675 Munchen (DE) 




MORITA, MasashI 




Banyu Pharmaceutical Co., Ltd. 






ibaraki 300-2611 (JP) 





(54) CANCEROUS METASTASIS-ASSOCIATED GENE 

(57) A gene exhibiting stronger expression in 
mouse IMG-HM cells than in mouse IMC-LM cells and 
isolated by using the differential display method. When 
the relation between the expression of the isolated gene 
and cancerous metastasis is examined, it is found out 
that a transformant with the larger expression dose of 
the antisense mRNA to the gene shows the more low- 
ered cancerous metastasis ability. By using the isolated 
protein, cancerous metastasis inhibitors can be 
screened. 
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DescripU n 

Technical Reld 

B [0001 ] The present invention relates to a protein associated with cancer metastasis and a gene encoding the same. 
Background Art 

[0002] In spite of the remarkable development of methods of diagnosis and treatment, cancer is still one of the main 
10 causes of death in most dey^eloped countries, including Japan. In malignant cancer, cancer cells not only randomly pro- 
liferate but also invade (infiltrate) surrounding organs and metastasize into distant organs. In fact, the majority of deaths 
due to cancer are caused by its recurrence by metastasis. Therefore, it seems possible to completely cure cancer by 
the surgical excision of cancerous foci if the metastasis or invasion of cancer cells is effectively suppressed. Now that 
a method for treating localized cancer cells is about to be established, the most prefen-ed subject for improving the treat- 
15 ment of cancer is now likely to shift to how to treat metastasis. Hence, analyses of genes associated with the invasion 
and metastasis of cancer cells and proteins as gene products thereof are extremely important for developing new can- 
cer treatment methods. 

[0003] Hitherto, a number of cancer metastasis-associated genes have been reported. For example, ''nm23° was dis- 
covered by the differential hybridization method in mouse melanoma K-1 735 subtypes exhibiting different colony form- 

20 ing abilities when they are transplanted through mouse tail vein to the lung. Its metastasis potency and expression in 
Northern Wot are inversely con-elated. Transfer of cDNA of "nm23" into a highly metastatic cell done has been reported 
to result in the reduction of experimental metastasis potency (cf. Steeg, P. S. et al., J. Natl. Cancer Inst., 80, 200-204 
(1988) and Leone,A. et al.. 25-35 (1991)). Also, a gene highly expressed in low metastatic cell lines was isolated by the 
differential hybridization method performed with low and high metastatic cells derived from rat prostate cancer R-3327 

25 cell line; said gene was reported to encode f ibronectin. In addition, a 4- to 8-fbld reduction of the expression of mRNA 
of f ibronectin was reported in many cell clones which became highly metastatic by allowing said clones to express the 
v-Ha-ras gene (cf. Schalken, J. A. et al.. Cancer Res., 48, 2042-2046 (1988)). Furthermore, a number of genes includ- 
ing KAI1 (Dong, J. T. et al., Sdence, 268, 884-886 (1995)), stromelysin-3 (Basset, P. et al.. Nature. 348. 699-704 
(1990)), pGM21 (Phillips, S. M. et al., J. Natl. Cancer Inst., 82, 199-203 (1990)), etc. have been reported as cancer 

30 metastasis-associated genes. 

[0004] However, most of these reports were limited only to studies with the reduction of migration and invasion abilities 
of cancer cells, in which the expression of a target gene had been suppressed, as indicators. No report has appeared 
on the suppression effect actually demonstrated in the assay system using experimental animals. 

35 Disclosure of the Invention 

[0005] An objective of the present invention is to provide a novel protein associated witii cancer metastasis and DNA 
encoding said protein. Another objective of this invention is to provide a vector carrying said DNA, a translbrmant har- 
boring said vector, and a method for preparing a recombinant protein that comprises culturing said transformant Still 
40 another objective of this invention is to provide DNA used for detecting, isolating, and amplifying said gene, or for sup- 
pressing the expression Uiereof. A further objective of this invention is to provide a method of screening a cancer metas- 
tasis inhibitor using proteins and genes associated with cancer metastasia 

[0006] Mouse IMC-HM cells, mutant cells isolated from mouse IMC cell line by Arakawa et al., are highly metastatic 
to the liver. When IMC-HM cells are subcutaneously tiBnsplanted to the ventral of mice, first they spontaneously metas- 

45 tasize to the liver, then rapidly metastasize to tiie main organs of the whole body, leading to the death of the host in 
about two weeks. In contrast, almost no metastasis potency is observed with IMC-LM cells, a subculture of the parental 
IMC cells. Mice transplanted witii said cells sun/ive for at least six weeks after the b'ansplantation, and no metastatic 
foci are observed at all with ttie naked eye (cf. Arakawa, H. et al., Jap. J. Cancer Res., 87, 518-523 (1996)). Alttiough it 
may be readily assumed that tiiese two cell lines share a very dose genetic background, they widely differ in properties 

so such as the presence or absence of spontaneous metastasis potency to distant organs. In view of these facts, the 
present inventors suspected Uie presence of a key gene controlling cancer metastasis in mouse IMC-HM cells, and 
attempted to isolate a strongly expressed gene from mouse IMC-HM cells as conpared with mouse IMC-LM cells using 
the differential display method. As a result, the present inventors succeeded in isolating two mouse cDNA fragments, 
which are candidates for the lador Indudng cancer metastasis. Furtiiermore, they screened a cDNA library derived 

S5 from mouse IMC-HA1 Cells using these cDNA fragments as probes, and succeeded in isolating full-length cDNA 
thereof 

[0007] The present inventors also investigated the relation between the expression of isolated mouse cDNAs and can- 
cer metastasis. Specifically, they constructed a vector expressing antisense-mRNA against one of ttie Isolated mouse 
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cDNAs (mouse CMAP), and expressed said vector in IMC-HA1 cells derived from IMC-HM cells to prepare many trans- 
r formants which exhibited different expression levels of the antisense mRNA. The inventors then transplanted the trans- 

'' formants into mice to investigate their cancer-metastasis abilities. As a result, it was found that the higher the antisense 

MRNA expression level of the transformant is, the lower its cancer metastasis potency is. The inventors also succeeded 
5 in isolating human cDNA (human CMAP) that exfiibited a high homology with mouse CMAP by the polymerase chain 

reaction using primers prepared based on the nucleotide sequence of isolated mouse CMAP. 

[0008] Since a close relation was recognized between the expression of isolated genes and cancer metastasis, the 

present inventors found it was possitile to screen cancer metastasis inhibitors using these proteins and genes. 

[0009] The present invention relates to a novel protein and gene associated with cancer metastasis, and to a method 
10 of using them to screen cancer metastasis inhibitors. More specifically, the present invention relates to: 

(I) a protein comprising any one of the amino add sequences set forth in SEQ ID NOs: 4, 6, 9, or 38, or a protein 
comprising any one of said amino add sequences having substitution, deletion, or addition of one or more amino 
acids and having cancer metastasis potency; 

IS (2) a protein encoded by DNA which hybridizes with DNA comprising any one of the nucleotide sequences set forth 
in SEQ ID NOs: 3, 5, 7, 8, or 37, said protein having cancer metastasis potency; 

(3) a DNA encoding the protein according to (1); 

(4) the DNA according to (3) comprising any one of the nucleotide sequences set forth in SEQ ID NOs: 3, 5, 7, 8, 
or 37; 

20 , (5) a DNA hybridizing with a DNA comprising any one of the nudeotide sequences set forth in SEQ ID NOs: 3, 5, 
7, 8, or 37, encoding a protein having cancer metastasis potency; 

(6) a vector can-ying the DNA according to any one of (3) to (5); 

(7) a transformant hart>oring the vector according to (6); 

(8) a method for preparing the protein according to (1) or (2), comprising culturing the transformant set forth in (7); 
2S (9) a DNA specifically hybridizing with the DNA according to any one of (3) to (5), comprising at least 1 5 nudeotide 

residues; 

(10) an antisense DNA against the DNA according to any one of (3) to (5) or a portion thereof; 

(II) an antitxxiy binding to the protein according to (1) or (2); 

(12) a method for screening a compound having cancer metastasis inhibitory ability, comprising steps of: 

J 30 

(a) contacting a test sample with the protein according to (1) or (2), and 

(b) selecting compounds having the activity to bind to the protein according to (1) or (2);, 

(13) a method for screening compounds having cancer metastasis inhibitory ability, comprising steps of: 

35 

(a) contacting test samples with cells expressing the protein according to (1) or (2), 

(b) detecting the expression level of the protein according to (1) or (2) in cells contacted with test samples using 
the antibody set forth (11), and 

(c) selecting a compound which reduces the expression level of the protein according to (1 ) or (2) as compared 
40 with that in cells not contacted with test samples. 

[0010] The present invention relates to a novel protein associated with cancer metastasis. The nudeotide sequence 
of cDNA isolated and designated 'CMAP' by the present inventors is shown in SEQ ID NO: 3, and the amino acid 
sequence encoded by said cDNA is shown in SEQ ID NO: 4. Mouse "CMAP" cDNA has been isolated by the differential 

45 display method as a gene which is highly expressed in a mouse cancer metastatic IMC-HM cell line derived from mouse 
IMC ceils. Its expression was not detected in the IMC-LM cell line, which had been similarly derived from mouse IMC 
cell line but had almost no detectable cancer metastasis potency (Cf. Example 1). Studies on the expression of mouse 
CMAP in various cancer cells revealed that it was expressed in M-5076 cells, LSI 78Y cells, P388 cells, and L121 0 cells, 
but not In IMC-HA1 cells, which formed metastasis fod mainly in the liver when transplanted through the tail vain of 

so mice. In contrast, B-16-BL6 cells and Colon 26 cells, in which the expression of mouse CMAP was not detectable, 
strongly induced the experimental metastatic fod in the lung, but did not exhibit hepatic metastasis. All other cell lines 
in which mouse CMAP was expressed below the detection limit were almost completely rejected and regressed when 
they were transplanted to tiie tail vein (cf. Example 7). Many b-ansformants that expressed antisense mRNA against a 
transcript of mouse CMAP cDNA in various levels were prepared and their cancer metastasis abilities in vivo were 

55 examined by transplanting them into mice. As a result it was found out that higher expression levels of antisense mRNA 
of transformant were related with lower cancer metastasis potendes of said transformant (cf. Example 9). These facts 
indicate a close association of mouse CMAP expression witti cancer metastasis. 

[0011] In addition, the pr sent inventors isolated cDNA (human CMAP cDNA) exhibiting a high homology with mouse 
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CMAP from human spleen by polymerase chain reaction using primers prepared based on the nucleotide sequence of 
> mouse CMAP cDNA. This cDNA's nucleotide sequence is shown in SEQ ID NO: 37, arvj the amino add sequence of 
the protein encoded by said cDNA is shown in SEQ ID NO: 38. Human CMAP had a homology as high as 71 .6% with 
mouse CMAP in the antino acid sequence (Cf. Example 8). This fact strongly indicates that human CMAP functions 

5 similarly to mouse CMAP. 

[0012] Furthermore, as in the case of mouse CMAP, the irrventors isolated three cDNA clones (#7.16411 , #8.323, and 
#1 1 .2441 3) that were cverexpressed in mouse IMC-HM cells, by the differential display method using mouse IMC-HM 
cells and IMC-LM cells. Nucleotide sequences of three cDNA clones are shown in SEQ ID NOs: 5, 7. and 8. Compari- 
son of these nucleotide sequences strongly indicated that '#7.1641 1" and "#1 1.24413' are produced when the #8.323 

10 is processed differently. Clones '#7.1641 1" and '#1 1 .24413' have an identical nucleotide sequence except for about 
100 bp at the 5'-end, and were presumed to encode proteins of 131 and 126 amino acids with 20 to 30 different amino 
acids of the N-terminus. Amino add sequences of proteins encoded by '#7.16411' cDNA and '#11.24413' cDNA are 
shown in SEQ ID NOs: 6 and 9, respectively. When the expression patterns of '#7.16411" and '#11.24413' in various 
cancerous and normal cells were examined, '#7.1641 1" was detected in almost all cancerous cells and normal tissues 

15 in spite of the difference in their expression level. However, '#1 1 .24413" was detected only in cancer cells such as IMC- 
HA1 cells derived from mouse IMC-HM cells, and P388 cells, and in normal tissues such as the thymus, lung, and 
spleen and slightly in the peripheral blood (cf. Example 6). These results indicate that #8.323 to '#7.1641 1' are proc- 
essed in the majority of nomial tissues, and "#1 1 .24413" is assumed to be expressed In specific cells. "#1 1 .24413" may 
be a factor in providing IMC-HA1 cells with properties such as a rapid invasion of the whole body. The above-described 

so characteristics indicate the possibility of using these proteins for screening compounds having cancer metastasis inhib- 
itory ability. 

[001 3] The protein of this invention may be prepared as a recombinant protein as well as a natural protein using the 
genetic recon^ination technique by methods Known to those skilled in the art. The natiral protein, for example, can be 
prepared by subjecting extracts of cells or tissues that are expected to express the protein of the present invention to 

25 affinity chromatography using the antibody of the present invention desaibed below. The recombinant protein may be 
prepared culturing cells transformed with the DNA encoding the protein of this invention as described below. 
[0014] The present invention also relates to a protein functionally equivalent to the atx>ve-desaibed protein (mouse 
CMAP, human CMAP, #7.1641 1. and #1 1.24413). For example, a functionally equivalent protein can be isolated by a 
method of indudng the amino add mutation in a protein. Such a method is known to those skilled in the art. Skilled per- 
J 30 sons would be able to prepare a mutant protein functionally equivalent to the natural protein by appropriately substitut- 
ing amino acKl(s} in the above-described protein (comprising any one of amino acid sequences set forth in SEQ ID 
NOs: 4, 6. 9, and 38) using Kunkel's method (Kiinkel. T A. et al., Methods Enzymol. 1§4. 367-382 (1987)), the double 
primer method (Zoller, H. J. and Smith, H., Mettiods Enzymol. 154. 329-350 (1987)), the cassette mutation mettiod 
(Wells, et al., Gene M. 315-23 (1985)), ttie megaprimer method (Sarkar, G. and Sommer, S. S., Biotechniques S. 404- 

35 407 (1990)), etc. Amino add mutation of a protein may also occur spontaneously Such a mutant protein derived from 
a natural protein by substituting, deleting, or adding amino adds in its amino add sequence, and proteins functionally 
equivalent to the natural protein are also induded in the protein of the present invention. Here, "functionally equivalent" 
means that the protein has cancer metastasis potency. Cancer metastasis potency of the protein can be detected by, 
for example, suppressing tiie expression or function of said protein in metastatic cancer ceils (for example, using the 

40 antisense technique), transplanting said suppressed cells into an animal body, and examining its cancer metastasis 
potency (Cf. Example 9). The extent of amino add mutation in the functionally equivalent protein compared wrtti the nat- 
ural type protein is usually 10% or less of the total amino acids, preferat)ly 10 amino adds or less, more preferably 3 
amino adds or less, still preferably 1 amino add. 

[001 5] Anottier method known in the art for isolating the functionally equivalent protein is the hybridization technique 
45 (Hanahan, D. and Meselson, H., Method. Enzymol. 100. 333-342 (1983) and Benton, W. D. and Davis, R. W., Sdence 
196. 1 80-1 82 (1 977)). Those skilled in the art can easily obtain a protein functionally equivalent to the above-described 
protein from DNA that is isolated based on tiie DNA sequences (SEQ ID NOs: 3, 5, 7, 8, and 37) encoding the above- 
desaibed protein or a portion thereof and highly homologous with said DNA sequences. Such a protein encoded by 
DNA hybridizing with the DNAs encoding the akxjve-described proteins and functionally equivalent to said proteins is 
50 also included in the protein of the present invention. Here, "functionally equivalent" means tfiat the protein has cancer 
metastasis potency similarly as described aix>ve. Hybridization for isolating functionally equivalent proteins is usually 
performed in 6 x BSC and 40% formamide at 25°C and subsequent washing with 1 x SSC at 55°C. Preferably, hybrid- 
ization is performed in 6 x SSC and 40% fonnamide at 37°C and washing with 0.2 x SSC at 55°C. More preferably, 
hybridization is perfrx-med in 6 x SSC and 50% formamide at 37°C and washing with 0.1 . x SSC at 62°C. TTiose skilled 
55 in the art can otiviously realize such stringent hybridization conditions by appropriately selecting the dilution rate of 
SSC, concenbBtion of formamide, temperature, etc. Animals from which functionally equivalent proteins are isolated 
include human, mouse, rat, rabbit, sheep, cow, dog, and pig, but are not limited thereto. A DNA encoding the protein 
thus isolated usually has high homology with a nudeotide sequence of cDNA encoding the atx>ve-described protein 



4 



EP 0 974 652 A1 



(nucleotide sequences set forth in SEQ ID NOs: 3, 5, 7, 8, and 37). High homology means a sequence identity of at 
j least 70% or more, preferably 80% or more, and more preferably 95% or more, at the nucleotide level. 

[001 6] In addition, the present invention relates to DNA encoding the at>ove-described protein of the present invention 
(mouse CMAR human CMAP, #7.18411, #1 1.24413, and a protein functionally equivalent thereto). There are no iimita- 

5 tions In the type of the DNA of this invention as long as it can encode the protein of the Invention. The DNA of this inven- 
tion thus includes synthetic DNA as well as cDNA and genoinic DNA. The DNA of this invention can be isolated by 
methods known in the art. For example, cDNA can be saeened by labeling cDNA encoding the protein of ttiis inventon 
(for example, cDNA comprising any one of nucleotide sequences set forth in SEQ ID NOs: 3, 5, 7, 8, and 37) or a frag- 
ment thereof, RNA complementary to said cDNA, or a synthetic oligonucleotide comprising a portion of the nucleotide 

10 sequence of said cDNA with ^^P, etc., and then hybridizing the labeled product with a cDNA library derived from cells 
or tissues expressing the protein of this invention. Alternatively, said cDNA can be cloned by synthesizing oligonucle- 
otides con'esponding to the nucleotide sequence of said cDNA, and amplifying them using a polymerase chain reaction 
with cDNA derived from appropriate cells or tissues as a tenplate. Genomic DNA can be screened by hybridizing a 
genomic DNA library with a probe such as cDNA encoding the protein of the present invention (for example, cDNA com- 

15 prising the nucleotide sequence of SEQ ID NO: 3, 5, 7, 8, or 37), a portion tiiereof, complementary RNA, or a synthetic 
oligonucleotide containing a part of the sequence of said DNA, which are labeled with ^^P and the like. Alternatively, 
genomic DNA can be cloned by amplification using polymerase chain reaction with synthetic oligonucleotide primers 
having a sequence con-esponding to the nucleotide sequence of ttie cDNA. Synttietic DNA can be prepared by, for 
example, chemically synthesizing oligonucleotides comprising a partial sequence of cDNA encoding the protein of the 

20 present invention (for example, cDNA comprising the nucleotide sequence set forth in SEQ ID NOs: 3, 5, 7, 8, or 37), 
annealing ttiem to form a double sti'and, and ligating them using a DNA llgase (Khorana, H. G. et al., J. Biol. Chem. 251. 
565-570 (1976); and Goeddel, D. V. et al., Proc. Nati. Acad. Sd. USAZS. 106-10 (1979)). 

[0017] TTie DNA thus prepared Is useful for producing a recombinant protein. The protein of the present invention can 
be prepared as a recombinant protein by inserting DNA encoding the above-described protein of this invention (for 

25 example, cDNA comprising the nucleotide sequence set fbrtin In SEQ ID NOs: 3, 5, 7, 8, or 37) into an appropriate 
expression vector, introducing said vector Into appropriate cells, culturing b'ansfbrmants thus obtained, and purifying a 
protein thus expressed. Specifically, when Escherichia coli is a host a plasmid vector such as pET-3 (Rosenberg, A. H. 
et al., Gene 56, 125-35 (1987)), pGEX-1 (Smith, D. B. and Johnson. K. S.. Gene SL 31-40 1988)), etc. can be used. 
Escherichia coli can be transformed by Hanahan's method (Hanahan, D., J. Mol. Biol. 1M, 557-580 (1 983)), ttie elec- 
) 30 troporationmethod(Dower.W.J.etal.,Nucl.AcidsRes.1S.6127-6145(1988)),etc.Arecombinantproteincanbesyn- 
thesized in the form of a fused protein in which it binds to the tag of histidine residues, glutathione S-fa^nsferase (GST), 
etc. at the N-terminus, etc., and can be purified by binding said fused protein to metal-chelate resin, GST-affinity resin 
(Smith, M. C. et al., J. Biol. Chem. 2^, 721 1-7215 (1988)), etc. The desired protein can be developed from the fused 
protein by cleaving the fused protein with thrombin tjlood coagulating factor Xa, etc. When a fission yeast, Schizosac- 

35 charomyces pombe. is a host, pESP-1 (Lu, Q. et al.. Gene 2QQ. 135-144 (1977)) or tiie like plasmid vector is used. For 
example, yeast is b^nsfbrmed by tiie spheroplast method (Beach. D. and Nurse, R, Natijre 2§Q, 140 (1981)), lithium 
acetate method (Okazaki, K. et al., Nucl. Acids Res. 18, 6485-6489 (1990)), etc. When pESP-1 is used, a recon*inant 
protein is synthesized as a fusion protein with glutathione S-b-ansferase (GST) and is purified by binding the fusion pro- 
tein to GST-aff inlty resin. The protein of interest can be separated from the fused protein by cleaving it witti ttirombin, 

40 blood coagulating factor Xa, etc. When mammalian cells such as CHO cells derived from Chinese hamster ovary, 
human HeLa cells, etc. are the host, pMSG (CLONTECH) or tiie like vector is used. The recombinant DNA can be inti-o- 
duced into mammalian cells by tiie calcium phosphate method (Graham. R L. and van derEb, A. J., Virology 456- 
467 (1973)), DEAE-dextran mettiod (Sussman, D. J. and Milman. G.. Mol. Cell. Bid. 4. 1641-1643 (1984)), lipofection 
method (Feigner, R L. et al., Proc. Nati. Acad. Sci. USA 84, 7413-7417 (1987)). electi-oporation method (Neumann, E. 

45 et al., EMBO J. 1 841 -845 (1 982)), etc. When insect cells are the host, tiie baculovirus vector pBacPAK8/9 (Clontech) 
or similar vectors are used. For exanrple, insect cells can be bsnsfbrmed according to methods as described in 
Bio/Technology, S, 47-55 (1980). 

[001 8] The present invention also relates to DNA specifically hybridizing with ttie above-described DNA of ttie present 
invention and having at least 15 nucleotide residues. Here, "specifically hybridize" means tiiat DNA does not cross- 
so hybridize with other DNAs encoding other proteins under sti-ingent conditions. Such DNA can be utilized as a probe for 
detecting and isolating DNAs encoding the protein of this invention, and as a primer for amplifying said DNAs. 
[0019] The present invention also relates to antisense DNA against ttie DNA encoding ttie protein of the present 
invention or a portion thereof. Such antisense DNA is utilized to suppress the expression of the protein of the present 
invention. For example, it is possible to suppress ttie cancer metastasis atiility of said cells by expressing such anti- 
55 sense DNA in cells with cancer metastasis potency to suppress the expression of ttie protein of ttiis invention. In order 
to display ttie antisense effect, the antisense DNA is at least 15 bp or more, preferably 100 bp or more, and usually 
shorter ttian 800 bp, preferably shorter tiian 600 bp. 
i [0020] Furthermore, this invention relates to an antibody binding to the protein of the present Invention. Antibodies 
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binding to the protein of the present invention can be prepared by methods known in the art (cf. 'Shin Seikagaioj 
Jlkkenkoza 1, Proteins I, 389-406, Tokyo Kagaku Dojin"). Polyclonal antibodies can be prepared as follows. A suitable 
amount of the above-described protein or peptide is administered to animals to be immunized such as rabbits, guinea 
pigs, mice, or chickens. The antigen may b administered together with an adjuvant (such as PIS and FCA) to stimulate 

5 the antibody production. Immunization is usually performed every several weeks. Repeated immunization allows ele- 
vating the antibody titer. After the final immunization, antiserum can be obtained by collecting the blood from immunized 
animals. Polyclonal antibodies can be prepared by fractionating the antiserum using the ammonium sulfate precipita- 
tion, anion exchange chromatography and affinity chromatography with protein A or the immobilized antigen. Mono- 
clonal antibodies can be prepared as follows. The protein of this invention or a partial peptide thereof is administered to 

10 animals to be immunized as described above. After the final immunization, the spleen or lymph node is excised from 
immunized animals. Antibody-producing cells contained in the spleen or lymph node are fused with myeloma cells 
using polyethylene glycol to prepare hybridomas. The hybridomas of interest are then screened and cultured. Mono- 
clonal antibodies can be prepared from the culture supernatant and purified by fractionation using the ammonium sul- 
faie precipitation, anion exchange chromatography and affinity chromatography with protein A or the immobilized 

IS antigen. Antibodies thus prepared can be used not only for affinity purification of the protein of this invention but also for 
diagnosis and antibody treatment of diseases (e.g. cancer) caused by the abnormal expression of the protein of this 
invention, and detection of the expression level of the protein. For antibody treatment, the antibody to be used is pref- 
erably a humanized antibody or human antitxxJy. Humanized antibody, for example, a mouse-human chimeric antibody, 
can be prepared by isolating the antibody gene from mouse cells producing the antibody against the protein of this 

20 invention, recombining the gene of the constant region of H chain with that of the constant region of human IgE H diain, 
and introducing the resulting recombinant gene into mouse myeloma J558L cells (Neuberger, N. S. et al., ^4ature 314. 
268-270 (1985)}. Human antibodies can be prepared by immunizing mice, whose immune system has been replaced 
with that of humans, with the protein of this invention. 

[0021 ] The present invention also relates to a metiiod for screening a compound with cancer metastasis suppressing 
S5 potency using the protein of this invention. One embodiment of the screening method according to this invention com- 
prises steps of (a) contacting the protein of this invention with test samples, and (b) selecting a compound that binds to 
the protein of ti^ls invention. Test samples used for the screening include purified proteins, expression products of genes 
(including gene library), extracts of tissues or cells, supernatant of cell cultures, synthetic low molecular weight com- 
pounds, and metatx>lltes of microorganisms, but are not limited to them. Test samples used for the screening may be 
30 appropriately labeled witti a radioactive substance, a fluorescent substance, etc., but are not limited to them. Binding of 
test samples with the protein of this invention can be detected by the label on a compound bound to the protein of this 
invention (for example, detecting the birxling by the radioactivity or fluorescence intensity). When the TWO-hybrid sys- 
tem is used [Zervos et al.. Cell 72, 223-232 (1994) and Fritz et al.. Nature 376, 530-533 (1995)), tiie binding can be 
detected tiy examining the activity of the reporter gene. Besides the above-described TWO-hyt)rid system, various 
35 methods known to ttiose skilled in the art can be applied to screening compounds binding to the protein of tills inven- 
tion. The methods include the affinity purification method using a column to which the protein of this invention Is Immo- 
bilized and various peptide display methods such as the phage display metiiod (R Parmly and G. P. Smith, Gene, 73. 
305(1988)). 

[0022] Another embodiment of the screening method of this invention comprises steps of (a) contacting test samples 

40 witii cells expressing the protein of ttiis invention, (b) detecting ttie expression level of the protein of tills invention in tiie 
cells contacted witti said test sanples using an antibody binding to the protein of this Invention, and (c) selecting a com- 
pound that reduces the expression level of the protein of this invention as compared with ttiat in cells not contacted with 
said test samples. In this screening metiiod, there are no limitations in ttie type of cells to be treated with test samples, 
as long as they express the protein of tills invention. Cells to be treated may thus include IMC-HM cells or cells derived 

46 from them such as M-5076 ceils and L^178Y cells. Test samples may include purified proteins, expression products of 
genes (Including libraries), extracts of tissues or cells, supernatants of cell cultures, syrrthetic compounds of low molec- 
ular weights, and metabolites of microorganisms but are not limited to these. Many known methods, including the 
ELISA method, immune precipitation metiiod, and Western knotting metiiod, are available without limitation for detecting 
the expression level of the protein oi this invention using an antibody The antibodies used are labeled for detecting the 

so protein of this invention if required. Exanples of labels include enzymes, radioactive substances, and fluorescent sub- 
stances, but are not so limited. H is also possible to detect tiie protein of tills invention wittiout labeling ttie antibody of 
the present Invention but by labeling the molecule specifically binding to tiie antibody of this invention such as a sec- 
ondary antibody or protein A. A desired connpound can also be screened by detecting the expression of mRNA as well 
as ttiat of ttie protein described above. The expression of mRNA can be detected by, for example, reverse tianscription 

55 PCR (RT-PCR ). Q3PDH can be used as a control. Compounds isolated by ttie above screening methods may be can- 
didates for suppressing the cU^tivity of the protein of this invention. These compounds can be utilized as cancer metas- 
tasis inhibitors. 

[0023] Effects of the compounds isolated by the above-described screening mettiods are preferably confirmed in vivo 
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as occasion demands. One embodiment of the in vivo detection metliod comprises steps of (a) contacting a compound 
isolated by the above-described saeening method with cells with cancer metastasis potency, (b) transplanting said 
cells to a nonhuman test mammal, and (c) detecting the cancer metastasis in said nonhuman test mammaJ to deter- 
mine whether said compound suppresses cancer metastasis When mice are used as the test animal, mouse IMC-HM 
5 cells or those derived from them are preferably used as the cells with cancer metastasis ability. In this case, cells can 
be transplanted to mice subcutaneously or through the tail vein. Cancer metastasis can be detected by examining the 
formation of metastasis foci, for example, in the liver. 

[0024] Another embodiment of the in vivo detection of cancer metastasis suppressing ability of the compound is the 
method of directly administering the test compound to mice. This method comprises steps of (a) administering a com- 

10 pound isolated by the above-described screening method to nonhuman test mammals, (b) transplanting cells with can- 
cer metastasis potency to said nonhuman test mammals, and (c) detecting cancer metastasis in said nonhuman test 
mamnrals to determine whether said compound suppresses cancer metastasis. The conpound can be administered 
orally, intraperitoneally, through the tail vein, subcutaneously, intracutaneously. or intramuscularly As in the above- 
described method, when mice are used as the test animal, cells with cancer metastasis ability are preferably mouse 

IS IMC-HM cells or those derived therefrom. These cells can be transplanted to mice sutxutaneously or through the tail 
vein, etc. Cancer metastasis can be detected by examining the formation of metastasis foci, for example, in the liver. 

Brief Description of the Drawings 

20 [0025] 

Rgure 1 is a photograph showing electrophdretic patterns of CMAP expressed in various cancer cells detected by 
Northern Ijlotting. 

Figure 2 is a photograph showing elecbxjphoretic patterns of CMAP expressed in normal tissues detected by 
25 Northern t}lotting. 

Figure 3 compares amino add sequences between CMAP and human family-2 cystatin. 

Best Mode for Implementing the Invention 

30 [0026] The present invention will be descrbed in more detail below with reference to examples, but is not to be con- 
strued to be limited to these examples. 

Example 1 Cloning of cancer metastasis-associated gene 

35 (1) Extraction of «ie total RNAs from IMC-HM and IMC-LM cells 

[0027] The total RNAs were extracted from IMC-HM and IMC-LM cells using ISOGEN (Nippon Gene). Rrst cultured 
cells in the logarrthmic growth phase (1x10^ cells) were collected and sedimented by centrifugation. After tiie culture 
supernatant was removed by aspiration, ISOGEN (1 ml) was added to ttie cells, and tiie mixture was allowed to stand 

40 at room temperature for 5 min to lyse the cells. Chloroform (0.2 ml) was ttien added to the mixture, and the resulting 
mixture was vigorously vortexed for 15 sec, allowed to stand at room temperature for several minutes, and centrifuged 
(4°C) at 12,000 X g for 15 min. The aqueous layer containing RNA was collected, and an equal volume of isopropanol 
was added thereto. The mixture was stirred, allowed to stand at room temperature for 10 min, and centrifuged again 
under cooling at 12,000 x g for 15 min. RNA thus sedimented was collected, rinsed with ethanol, and dissolved in ster- 

45 ilized pure water (50 fil), which had been ti-eated with diethyl pyrocartx>nate (DEPC) to obtain a total RNA solution. 
When cultured cells (5x10^ cells) from which the total RNA was isolated were ti-ansplanted subcutaneously into the 
venti-ai of a five-week old female CDF1 mouse, observation with Vne naked eye revealed that IMC-HM cells metasta- 
sized to the liver of the host mouse to kill it in two weeks, while IMC-LM cells did not metastasize to the liver, allowing it 
to live for six weeks. 

50 

(2) Preparation of single-stranded cDNA using reverse transcriptase 

[0028] A complementary single-stranded DNA was prepared witti the total RNA obtained above as a template using 
reverse transcriptase Superscript II (GIBCO BRL) derived from MMLV and an mRNA Fingerprinting Kit (Clontech). A 
55 cDNA synthesis primer (oligo cTT primer) (1 pi) was added to the total RNA (2 ng), and the total mixture volume was 
made 5 (J by adding DEPC-treated, sterilized pure water. The mixture was kept at 70°C tor 3 min then quickly cooled. 
A mixed solution (total volume 5 jd) *e reaction buffer (5 x first sb-and buffer) (2 nl), dNTP mix (each 5 mM) (2 fj), 
and Supersaipt II (200 unit/pl) (1 (J) was added to tills mixture, and the resulting mixture was allowed to stand at 42<>C 
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for 1 hour in an air incubator. After the reaction was completed, the mixture was treated at 72°C for 10 min, diluted 160- 
fbld with the sterilized pure water, and stored at -20 "C . The RNA fingerprinting was then performed based on polymer- 
ase chain reaction (PGR) using this single-stranded cDNA solution. 

5 (3) Amplification of single-stranded cDNA by PGR 

[0029] Equal anraunts of LA Tag DNA polymerase (Takara Shuzo) (5 U/ ^1) and undiluted solution of TaqStart Anti- 
body (registered trade mark, Glorrtech) were mixed, and the mixture was allowed to stand at room temperature for a few 
rr^nutes to make a 50 x polymerase mix. This mix (0.4 (il) was combined with 10 x LA PGR buffer (Takara Shuzo) (2 ^1), 

10 sterilized pure water (13.9 jil). dNTP mix (Gtontech) (5 mM each) (0.2 nl), and p^SJdATPaS (Amersham) (0.5 pi), and 
tile resulting mixture was used as a PGR master mix. To the stock solution of the single-stranded cDNA (1 pS) were 
added 20 jiM of P primer (PI -10/SEQ ID NOs: 10 to 19), 20 nM of T primer (T1 -9/SEQ ID NOs: 20 to 28) (Qontech) (1 
^l each), and the above-described PGR master mix (1 7 jil). The resulting mixture was overlaid with mineral oil (Sigma), 
and PGR was performed in a DNA thermal cycler PJ2000 (PERKIN ELMER GETUS) to amplify several genes at once. 

15 The PGR comprises one cyde of 94°C for 5 min, 40°C for 5 min, and 68°G for 5 min; two cycles of 94''G for 2 min, WC 
for 5 min, and 68°G for 5 min; 25 cycles of 94°G for 1 min, 60°G for 1 min, and 68°G for 2 min; and finally 68°G for 7 min. 

(4) Separation of PGR products by electrophoresis 

so [0030] A plate for the sequence gel (20 x 40 cm) was assembled using a spacer according to the standard method. 
A 6% glycerol-resistant gel and glyceroi-resistant buffer (United States Biochemical) were used. PGR products (3.5 pi 
each) amplified with P primer and T primer were mixed with a reaction termination solution (2 \ii each), heated at 94<'C 
for 2 to 3 min, and then cooled. The resulting mixture (3 1 each) was loaded onto the glycerol-resistant gel and elec- 
trophoresed at 35 W for about 3 hours. After electrophoresis, the gel was transferred onto a filter paper, which was dried 

2S at eO'C for 2 hours in a gel dryer. Radioactivity in the gel was copied on an imaging plate for BAS2000 (Fuji Film) by 
leaving it on said plate overnight, and Imaged with BAS2000. 

[0031] This imaging analysis was used to search for genes whose expression level significantly differred between 
IMG-HM cells and IMG-LM cells. If the expression level of one gene is predominantiy high in IMC-HM cells, said gene 
can be a candidate of a gene that Induces cancer metastasis. Furthermore, If the expression level of one gene is pre- 
j 30 dominantiy high in IMC-LM cells, said gene would be a candidate of a gene that suppresses cancer metastasis. Two 
bands of possible candidates for the factor inducing metastasis were found from IMC-HM cells. One band was detected 
in PGR products otitained from a primer set of P2 and T3, and the other from a primer set of P2 and T8. The former 
band was designated '23-1 ," and the latter band, "28-1 .' 

35 (5) Gloning of IMG-HM cell-specific cDNA bands 

[0032] IMG-HM cell-specific cDNA bands 23-1 and 28-1 were seised directly from the dried gel, and each gel seg- 
ment was trailed in sterilized pure water (40 (d) for 10 min to elute the desired cDNA fragments. This minute quantity of 
cDNA fragment was re-anplif led as follows. 

40 [0)33] 1 0 x EX Tag buffer (Takara Shuzo) (5 jJ), dNTP mix (each 2.5 mM) (1 (iQ. EX Taq DNA polymerase (Takara 
Shuzo) (5 U/fJ) (0.5 nO. and sterilized pure water (31.5 nO were mixed to make a PGR master mix. The eluted cDNA 
fragment (7 pi) and the P primer and T primer (2.5 pi each) which had been used for detecting said cDNA band were 
added to this mix. The mixture was overiaid witii mineral oil and subjected to PGR comprising 20 cycles of 94''G for 1 
min, 60°G for 1 min, and BSfC for 2 min In a DNA thermal cycler PJ2000 (PERKIN ELMER GETUS) to specHically 

4S amplify the desired cDNA fragment PGR products were electrophoresed on 1% agarose gel using a Mupid (COSMO- 
BIO) at 50 V for 1 hour and jderttified by staining wifli ethidium bromide. 

[0034] In general, PGR products of cDNA have additional A at their 3' -terminus, and may be directiy cloned by a TA 
type vector (having T at its terminus). Therefore, cDNA fragments thus re-amplified were cloned using a TA cloning kit 
(Invitrogen) according to the following method. A 10 x ligation buffer (Invitrogen) (1 ^0■ PCR II vector (registered ti-ade 

so mark, Invitrogen) (25 ng/ml) (2 jd), sterilized pure water (9 nl). and T4 DNA ligase (Invitrogen) (1 jd) were added to re- 
amplified PGR fragments (5 nO. and the mixture was allowed to stand at 14'»G overnight to insert the cDNA. A tube of 
One Shot INVaP competent cells (registered trade mark, Invitrogen) was mixed with 0.5 M p-mercaptoethanol (50 ^0- 
The above-described vector previously ligated with cDNA (2 pi) was added to this mixture, and the resulting mixture 
was incubated at 42°G for 30 sec. After standing on ice for 2 min, an SOG medium (450 pi) was added, and the mixture 

55 was shake-cultured at 37»C for 1 hour. The solution otrtained after the above transformation procedure (1 00 pi and 200 
mO was spread onto a previously prepared LB agar plate containing 50 pgAnl kanamycin and X-<3al and allowed to 
stand at 37"G overnight. The vector wittiout the insertion reacts with X-Gal to turn blue while the one with cDNA incor- 
porated turns white. According to this principle, 10 white colonies were picked up and incubated again to be used for 
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sequencing; at the same time, replicas were prepared. Plasmids were prepared from each transformant cell strain using 
a QIAprep Spin Plasmid Kit (QIAGEN) by the following procedure. 

[0035] Each transformant cell strain was incubated in an LB medium (3 ml) containing 50 (ig/ml kanamycin at 37°C 
for about 8 hours and centrlfuged to collect the ceils. These cells were dissolved in buffer P1 (QIAGEN) (250 pi), mixed 
5 with buffer P2 (QIAGEN) (250 mI). further mixed with buffer P3 (QIAGEN) (350 (J), and then centrifuged at 10,000 x g 
for 10 min. The supernatant was transferred onto a QIAprep Spin column (QIAGEN) and centrifuged at 10,000 x g for 
1 min. The column was washed with buffer PB (QIAGEN) to rennove the nuclease activity and further washed with the 
buffer PE (QIAGEN) (0.75 rrS). The plasmid retained in the column was then eluted with the TE buffer and collected. 
The concentration of the plasmid was calculated from the A260 value determined with a Hitachi spectrophotometer (ti- 
ro 3300). 

(6) Sequencing of the cloned IMC-HM cell-specific cDNA fragment 

[0036] A nucleotide sequence of cDNA was determined by the dye terminator method (equipment used, ABI Model 
>5 377). Although performance conditions may vary depending on the purification grade and concentration of plasmid 
sending as a template, in general the sequence consisting of 500 to 600 nucleotides may be determined at the same 
time using the dye terminator method. pCR II vector has the fbnvard and reverse primer sites of Ml 3 near the insertion 
site. Using each primer previously fluorescence-labeled, both strands were compared to determine the respective 
sequences. AmpliTaq (25 jd), 5 x buffer (25 pi), and purified water (125 pi) were mixed to dilute the aizyme, and premix 
20 was prepared for each of A, C. G, and T. To prepare premix of A or C, equal volumes of d/ddNTP, dye-primer, 5 x buffer, 
and the diluted enzyme (25 pi each) were mixed; the premix of G or T was prepared by mixing 50 pi each of these ingre- 
dients. The template DNA (100 to 200 ng/pl) (1 pi) and the premixes of A and C (4 pi each), and said template ONA (2 
pi) and the premixes of G and T (8 pi each) were separately mixed and »jbjected to cycle sequencing reaction with 25 
cycles of 96°C for 2 min, 96''C for 1 0 sec, 50°C for 5 sec. and 60»C for 4 min using a PERKIN ELMER 9600. Four clones 
25 randomly selected from 10 each cloned plasmids were sequenced, and all of them were found to have identical nucle- 
otide sequences. One of the plasmid clones obtained from 23-1 was then designated 23-1 #2, and that obtained from 
28-1 was designated 28-1 #3. 

[0037] Next, sequences determined for 23-1#2 (SEQ ID NO: 1) and 28-1#3 (SEQ ID NO: 2) were analyzed with a 
computer using software from Genetics Computer Group, Wisconsin Package version 8.1 . GenBank and EMBL data- 
30 base searches for the sequence homology with a FiastA revealed that these sequences were entirely novel and had 
never been reported. 

Example 2 Establishing donal lines reflecting properties of IMC-HM cells and IMC-LM cells 

35 [0038] Clonal lines of IMC-HM cells and IMC-LM cells were established by the limiting dilution culture method. Cells 
of each strain were distributed on a 96-well plate to 4 cells/10 wells, and 10 clones that had each proliferated from a 
single cell were randomly selected. 

[0039] In order to detect the expression of 23-1 #2 and 28-1#3 In each of these clones, RT-PCR was performed as 
follows. The total RNA and the reverse transaiptase were prepared sim'larly to the above-described methods ((1) and 
40 (2) in Example 1), In order to amplify the specific cDNA by PCR, single-stranded oligo DNA primers, "23-1#2Fr 
(CAGAATCTGCTCATGCAGTC (SEQ ID NO: 29)), "23-1#2R1" (CACTCCTTACTTTCCACCCC (SEQ ID NO: 30)), "28- 
1#3F1" (AACAGCATTTCCCTAAAGCTCQG (SEQ ID NO: 31)), and "28-1#3Rr (TGGAAACTACTTCCCTGCTCCCA 
(SEQ ID NO: 32)) were designed. 

[0040] Portions of the sequences of 23-1#2 and 28-1#3 can be specifically amplified using a set of 23-1#2F1 and 23- 
45 1#2R1 and another set of 28-1 #3F1 and 28-1 #3R1. Thirty cycles of PCR were performed using single-stranded cDNAs 
prepared from cells derived from IMC-HM and IMC-LM cells as a template, and EX Tag DNA polymerase under condi- 
tions similar to tiiose in Example 1 (5). As a result, it was observed that botti 23-1#2 and 28-1#3 were amplified as 
cDNAs of a similar appropriate size in the IMC-HM-derived clonal cells, while no signals were detected in the IMC-LM- 
derived clonal cells. 

50 [0041] Next, in order to confirm the metastasis potency of each of the clonal cells in vivo, 5x10'* cells were sut)cu- 
taneously transplanted to CDF1 mice. All clonal cells derived from IMC-HM killed the host two to three weeks after the 
transplantation, and their remarkable metastasis was observed with the naked eye in the liver and spleen while no 
metastasis foci were observed at all with tiie cells derived from IMC-LM. In addition, no distinct difference was detected 
between the cells derived from IMC-LM and those derived from IMC-HM, even by the obsenmtion of morphology, pre- 
ss liferation ability in vitro, etc. One clonal cell strain each was selected from the IMC-HM- and IMC-LM-derived clonal cells 
and designated "IMC-HAI" and 'IMC-LE5,' respectively. The IMC-HA1 cell was deposited with the following depositary 
institution: 
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(a) Name of the depositary institution and its address: 

Name: National Institute of Bioscience and l-luman-Technology Agency of Industrial Science and Technology 
Address: 1-3, Higashi 1-chomeTsukut}a-shi Ibaraki-ken 305-0046 Japan 

5 

(b) Date of deposition (Date of original deposition): April 1, 1997 

(c) Accession Number: PERM BP-6242 

Example 3 Analysis of clones 23-1 #2 and 28-1 #3 in IMC-HA1 and IMC-LE5 cells by Northem blot 

10 

[0042] In order to examine whether the expression of clones 23-1 #2 and 28-1 #3 differs at the poly(A) RNA level, 
Northern blot analysis was performed. Poly(A) RNA was prepared from IMC-HA1 and IMC-LE5 cells using a Messen- 
ger RNA Isolation Kit (STRATAGENE) as follows. 

[(K)43] About 10° cells were reco\^ered from a culture suspension (100 ml) and dissolved in a denaturing solution 
is (STRATAGENE) (5 ml) containing p -mercaptoethanol (50 fi I)- An elution buffer (STRATAGENE) (1 0 ml) was added to 
this mixture, and the resulting mixture was cenb-ifuged at 12,000 x g for 10 min. The supernatant was mixed with oligo- 
dT cellulose (0.04 g/ml) (5 mO and gently stirred at room temperature for 15 min. After the mixture was centrifuged at 
700 x g for 3 min, the supernatant was removed, and the cellulose was resuspended in a high-salt buffer (STRATA- 
GENE) (5 ml). This washing process was repeated twice, and then the cellulose residue was suspended in a low-salt 
20 buffer (STRATAGENE) (5 ml). The cellulose suspension was then packed into a 2.5-ml push column (STRATAGENE), 
and the remaining poly(A) RNA was eluted with an elution buffer (400 pi) at 68°C. 3 M sodium acetate (40 ^l) and 100% 
ettianol (1 .1 ml) were added to the eluate, and tiie mixture was centrifuged at 15,000 x g. Precipitates thus obtained 
were rinsed with 80% ethanol and dissolved in a TE buffer containing no RNase (40 ^) to make a poly(A) RNA solution. 
Ck>ncentration was determined by measuring the absorbence (A260). 
25 [0044] Next, for Northern blot analysis, 23-1 #2 and 28-1 #3 were directly radio-labelled with [a-^P]dCTP using a Meg- 
aprime DNA labelling system (Amersham) according to the standard method. The primer (5 fil) was added to cDNA 
(100 ng) to make the total volume 33 \iS, and the mixture was heated at 95°C for 5 min and allowed to stand at room 
temperature. A labelling txjffer (Amersham) (1 0 jJ), [a-^^PjdCTP (5 (J). a«Td Wenow DNA polymerase I (1 unit/ ^0 (2 fil) 
were added to this mixture , and the mixture was incubated at 37 °C for 1 0 min. To terminate the reaction, 0.5 M EDTA 
J 30 (3.5 )il) was added to this mixture , and the labelled probe and unreacted [a-^^P]dCTP were separated using a Sepha- 
dex G-50 spin column. Eluates from the column were diluted with purified water, and radioactivity of the probe was 
counted with a liquid scintillation counter 

[0045] Nortiiern blot analysis was ttien performed. Poly(A) RNA (2 p g) was loaded on a 1% agarose gel prepared in 
1 x MOPS buffer and electrophoresed at 50 V tor 1 .5 hour. After the electrophoresis, the gel was left standing overnight 
35 to transfer the nucleic add from the gel onto a Hybond N+ membrane (Amersham) via 10 x SSC buffer. After the ti'ans- 
fer, the membrane was air-dried, and the nucleic acid on the membrane was fixed by UV with a STRATALINKER. After 
the UV treatment, hybridization was carried out as follows. 

[0046] The membrane was soaked in a prehybridization solution (containing 5 x Denhardt's solution, 50% formamide, 
2% SOS, and denatured salmon sperm DNA (100 (ig/ml)), and prehybridization was performed at 42°C for 1 hour [a- 
40 ^^P]dCTP-lat>elled 23-1#2 and 28-1#3 were tinen added to the above-described prehybridization solution as the probe, 
and said membrane was allowed to stand at 42°C overnight. After the hybridization, the membrane was washed once 
with 2 X SSC in 0.1% SDS solution at SO'C for 30 min and twice with 0.5 x SSC in 0.1% SDS solution at SO'C for 30 
min. The radioactivity was imaged according to the above-described method. 23-1^ and 28-1 #3 exhibited bands of 
about 1 .1 kb and 0.7 kb, respectively, which hybridized only witii the bands from IMC-HA1 Cells. 

45 

Example 4 Analysis of clones 23-1#2 and 28--\«3 in IMC-HA1 and IMC-LE5 cells by Soutiiem blot 

[0047] Genomic DNA was prepared from IMC-HA1 and IMC-LE5 cells using a DNA Extraction Kit (STRATAGENE) 
and digested with EcoRI. The digested products were loaded (10 ^g each) onto a 1% agarose gel prepared in a TAE 
so buffer, separated by electrophoresis at 50 V, transferred to Hybond N-t-, and subjected to Southern blot analysis. Both 
cDNA probes exhibited identical band patterns. Therefore, it was indicated that both 23-1#2 and 28-1#3 were present 
in the genome of both IMC-HA1 and IMC-LE5 cells but expressed more predominanfly in IMC-HA1 cells. 

Example 5 Cloning of full-length cDNAs of cbnes 23-1#2 and 28-1#3 

55 

(1) Preparation of cDNA library derived from IMC-HA1 cells 

[0048] A cDNA library derived from IMC-HA1 cells was prepared using a ZAP Express cDNA Synthesis Kit (STRAT- 
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AGENE) by the folkmring method. 

(a) Preparation of the total RNA and poly(A) RNA 

[0049] The total RNA was extracted from IMC-HAI cells by the, method described in Example 1 (1). In this case, 
extraction with ISCX3EN was repeated twice to reduce the contamination of proteins, lipids, DNA fragments, etc. From 
this total RNA, poly(A) RNA was extracted as follows. A spin column was prepared by placing a Whatmann 3M fitter 
paper on the bottom of a 2.5-ml syringe and filling it with 5 ml of oligo(dT)-cellulose (0.04 g/ml) (STRATAGENE). The 
column was loaded with a sample, allowed to stand at room temperature for 30 min, then washed three times with a 
high-salt buffer and a low-satt buffer (Oontech). After elution with an elution buffer (Clontech), a 1/5 volume of a sample 
buffer was added to the eluate,the resuHing mixture was loadeded onto a new column, and the same procedure was 
repeated. After precipitation with ethanol and rinse, the product was dissolved in a TE buffer to make a poly(A) RNA 
solution. 

(b) Synthesis of the first strand cDNA 

[0050] Poly(A) RNA (5 ng) and Xhol linker primer (1 70 pmol) were mixed, treated at 70°C for 5 min, then cooled in 
ice. 5 X first strand buffer (10 ^1), 0.1 M dithiothreitol (GIBCO BRV) (5 mI), dNTP mix (10 mM each) (3 and Rnasein 
(40 U/pS) (STRATAGENE) (1 ^ were added to this mixture , and reverse transcriptase Superscript II (GIBCO BRL) 
(12,000 U) was added to make a final volume of 50 fJ. The resulting mixture was incubated at 44°C for 1 hour. 

(c) Synthesis of the second strand cDNA 

[0051] A 10 X second strand txjffer (20 second strand dNTP mix (STRATAGENE) (6 lil), sterilized pure water 
(1 13.9 ^l), and [o -^^pjdATP (Amersham) (2 nl) were added to the first strand cDNA synthesized above (45 pi), and E. 
coli RNase H (3 U) and £ co// DNA polymerase (STRATAGENE) (100 U) were added. The mixture was Incubated at 
16°C for 2.5 hours to synthesize the second strand. 

(d) Insertion of cDNA into the ZAP Express Vector 

[0052] The free terminus of the Xhol linker primer was tilunted with Pfu DNA polymerase (STRATAGENE) (5 U), 
ligated to the EcoRI adaptor with T4 DNA ligase (4 U) by allowing the mixture to stand at 8°C overnight, and further 
phosphorylated at the EcoRI temtinus with T4 polynucleotide kinase (10 U). The Xhol linker terminus was cleaved by 
Xhol (120 U). cDNA was inserted into the ZAP Express Vector between the EcoRI and Xhol sites by reacting with the 
T4 DNA ligase at 1 2°C overnight 

(e) In vitro packaging and titer assay 

[0053] cDNA was packaged into phage using a Gigapack Gold II (STRATAGENE) exactiy according to the process 
recommended by the supplier's protocol. The titer of the phage solution after packaging was assayed as follows. The 
original phage solution and its 10-fbld dilution with an SM buffer were prepared separately, and 1 ^1 each of both solu- 
tions was incutated with 200 \il of XL1 -Blue M RF (ODeoo = 0.5) at 37»C for 1 5 min to infect the host bacteria. Incubated 
mixtures were overlaid on a 100 mm NZY agar (GIBCO) plate together with 3 ml of an LB top agar, 0.5 M IPTG, and 
1 2.5 mg of X-Gal, and allowed to stand overnight. Ninety-five percent or more of plaques were found to retain the insert, 
and the volume of phage solution required for obtaining 50,000 plagues was determined from the number of plaques 
formed. 

(f) Amplification of cDNA library 

[0054] Appropriate amounts of phage-infected host bacteria were overlaid on 20 previously prepared 150 mm NZY 
agar plates together with the top agar so that 50,000 plaques/plate could be obtained. TTiese plates were then incu- 
bated at 37°C overnight to form plaques. After 8 ml of SM buffer was added, the plates were allowed to stand at 4°C 
overnight. The SM txiffer in which phage was sufficiently released was recovered, and the plates were further rinsed 
with 2 ml of the same buffer to recover the phage. This was distributed in aliquots as the cDNA library stock solution 
and stored frozen at -70°C. This stock solution was used for screening the cDNA library . 
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(2) Screening of cDNA library derived from IMC-HA1 cells 
(a) Primary screening 

5 [0055] Twenty NZY agar plates were prepared in sterilized square #2 petri dishes. XL1 -Blue M RF' adjusted to ODgoo 
= 0.5 was infected with the cDNA library phage so as to obtain 50,000 plaques/tilate, overlaid on the above NZY agar 
plates together with an LB top agar, and allowed to stand at 37°C overnight. The next day, after suitable plaque fomia- 
tion was confirmed, the plates were cooled at 4°C for 2 hours, and DNA in plaques was transferred to a Hytx)nd N-t- 
memtxane. The membrane was denatured, dried, UV cross-linked, then subjected to hybridization by the following pro- 

10 cedure. 

[0056] The membrane was soaked in a solution containing a 2 x PIPES buffer, 50% formamide, 0.5% SDS, and dena- 
tured salmon sperm DNA (100 \i g/wi) and prehybridized at 42 'C for 1 hour. [ o - ^^PjdCTP-labeled 23-1 #2 and 28-1 #3 
were prepared according to the above-desaibed method and added as probes to the prehybrldization solution. The 
mixture was allowed to stand at 42°C overnight. After hybridization, the membrane was washed three to four times with 
15 a solution containing 0.1 x SSC and 0.1% SDS at BS'C for 20 min. The radioactivity was imaged according to the 
atwve-described method. Images were printed out in the actual size of the plate using a BAS 2000. Points correspond- 
ing to positive plaques on the plate were punched out with a sterilized and taunted tip of a PI 000, and the phage was 
eluted into an SM buffer. 

20 (b) Secondary screening 

[0057] Phage solutions that had been judged positive in the primary screening were further screened by a procedure 
similar to that described above. In this case, a serial 10-fbld dilution of the phage solution was prepared and overlaid on 
100 mm plastic dishes in place of sterilized square #2 petri dishes. The phages localized at the point judged positive in 
25 the secondary screening were eluted into the SM buffer, and the DNA insert was amplified using a set of T7 and T3 
primers. At this stage, when many inserts were detected, a similar screening process was continued. 
[0058] This saeening of the cDNA library yielded a phage clone #9.31 33 that specifically binds to 23-1 #2, and clones 
#7.1641 . #8.32, and #1 1 .2441 that specifically bind to 28-1#3 were obtained. 

30 (c) Cloning into pBK-CMV phagemid vector 

[0059] It is possible to directly excise out the ZAP Express vector in vivo as the pBK-CMV phagemid vector containing 
inserts in the presence of a helper phage (ExAssist helper phage). The pBK-CMV phagemid vector was excised out 
from the atxjve-described phage clones, introduced into XLOLR cells, and allowed to form colonies on kanamycin-con- 
35 taining LB plates for cloning. Phagemid vector clones #9.31334, #7.1 641 1 , #8.323, and #1 1 .2441 3 were thus obtained. 

(d) Sequencing of dones #9.31334, #7.1641 1 , #8.323, and #1 1.24413 

[0060] Partial sequencing of the clones was performed by the dye primer method using the pBK-CMV phagemid vec- 

40 tor containing these cDNAs. Specifically; sequencing was performed similarly as described in Example 1 (6), exc^t 
that the cycle sequencing reaction was earned out using T3 primer (AATTAACCCTCACTAAAGGG/SEQ ID NO: 33) or 
T7 primer (GTAATACGACTCACTATAGGGC/SEQ ID NO: 34). Furthermore, a primer to be used in RT-PCR and an 
appropriate primer were designed based on the sequence of the insert in the pBK-CMV phagemid vector whose cDNA 
sequence was partially determined by the dye primer method. The undetermined region was then sequenced by the 

46 dye terminator method using a PRISM Ready Reaction Terminator Cycle Sequendng Kit Double-strarxJed DNA (250 
to 500 ng), primer (3.2 pmol), and the atx>ve-described premix (8 \si) were mixed in a final volume of 20 ^, and the cyde 
sequencing was performed by the reaction of 96°C for 2 min, and 25 cycles of 96''C for 1 0 sec, 50°C for 5 sec, and 60°C 
for 4 min. Electrophoresis and analysis were performed under conditions similar to tiiose in tiie dye primer method. 
[0061] As a result clone #9.31334 having the whole length of 983 bp (SEQ ID NO: 3) was obtained for 23-1#2. This 

so sequence was found to be a novel gene comprising the region encoding 167 amino adds. This amino add sequence 
exhibited about 38% homology to the bovine cystatin B, an inhibitory protein for cathepsins. Furthermore, three cDNA 
clones (#7.16411, #8.323, and #11.24413) were obtained for 28-1 #3. Comparative studies on these nucleotide 
sequences revealed sti'ong indications that #8.323 can be processed differentiy to produce #7.16411 and #1 1.24413. 
Clone #8.323 was 1,114 bp while dones #7.16411 and #11.24413 were 691 bp and 684 bp, respectively Clones 

55 #7.1641 1 and #1 1 .24413 represented an identical nudeotide sequence except for about 100 bp on the 5'-terminus side 
and were considered to encode the proteins of 131 and 126 amino acids with a difference of 20 to 30 amin adds at 
the N-terminus. From the deduced amino add sequences, #7.1641 1 was considered to have a hydrophobic N-termi- 
nus, while #11.24413 was considered to be hydrophilic. A homology search indicated that three kinds of sequences 



12 



EP 0 974 652 A1 



obtained using 28-1 #3 also represented novel genes similar to #9.31334. Furthermore, a protein presumed to be 
encoded by #7. 1 641 1 exhibited nearly 60% homology to human SLPI (secretoary leutoprotease inhibitor). Nucleotide 
sequences of these cDNA dones are shown in SEQ ID NO: 5 (nucleotide sequence of the clone #7.1641 1), SEQ ID 
NO: 7 (nucleotide sequence of the clone #8.323). and SEQ ID NO: 8 (nucleotide sequence of the clone #1 1 .24413). 

Example 6 Expression of #7.1641 1 and #1 1.24413 in various cancer cells and normal tissues 

[0062] Primers which specifically amplify #7.1641 1 and #1 1.24413 were designed, and the expression of these 
clones in varraus cancer cells and normal tissues was examined by FTT-PCR. PGR was performed under the conditions 
in which each cDNA can be quantitatively detected in IMC-HA1 cells (conditions described below in Example 7). As set 
of primers, "7F2- (SEQ ID NO: 35) and "28-1#3F1" (SEQ ID NO: 31) were used to anplify #7.16411, and "11F2" (SEQ 
ID NO: 49) and "28-1 #3F1 " (SEQ ID NO: 31), to amplify #1 1 .24413. As a result, it was found that #7.1641 1 was able to 
be amplified in almost all cancer cells and normal tissues examined, though the gradation of developed color differed 
somewhat. In contrast, #1 1.24413 could be detected only in cancer cells such as the P388 and IMC-HA1 cells, and in 
normal tissues such as the thymus, lung, and spleen, and slightly in peripheral blood. From these results, it is highly 
possible that the processing to produce #7.16411 is common in the majority of normal tissues, while #11.24413 is 
expressed only in particular cells. #1 1 .24413 was considered to be an important factor to impart to the IMC-HAl cells 
characteristics such as the ability to rapidly invade the whole body. 

Example 7 Expression of mouse CMAP in various cancer cell lines and normal tissues 

[0063] The amino acid sequence deduced from the coding region of #9.31334 is designated mouse CMAP and is so 
called below. 

[0064] Expression of mouse CMAP was wamined In 12 different mouse cancer cell lines by reverse transcription 
PCR (Fn"-PCR)(Rgure 1 ). Total RNA was extracted from each cell line by a method similar to that in Example 1 (1), and 
a cortplementary single-stranded cDNA was prepared similarly to that in Example 1 (2). Mouse CMAP was then 
detected by PCR similarly as in Example 1 (5). The reaction was comprised of 94°C for 2 min; 30 cycles of 94°C for 1 
min and 68°C for 2 min; and 68°C for 5 min, using a set of primers "23-1#2Fr (SEQ ID NO: 29) and "23-1#2R1 " (SEQ 
ID NO: 30). Preliminary examination of the dilution of a single-stranded cDNA serving as a template confirmed that sub- 
quantitative conditions producing a nearly linear relationship between the dilution and product yield were established 
for IMC-HAl cells when CMAP was diluted 10-fold for the amplification and the control G3PDH was diluted 40-fold for 
the detection. 

[0065] Mouse CMAP was thus clearly expressed in M-5076 cells, LSI 78Y cells, P388 cells, and LI 21 0 cells in addi- 
tion to IMC-HAl cells. Anatomical observation with the naked eye and pathological analysis confirmed that all of these 
cell lines formed metastatic fod mainly in the liver when 1x10* cells were transplanted through the tail vein of mice. B- 
16-BL6 cells and Colon 26 cells in which almost no expression of mouse CMAP could be confirmed strongly induced 
an eoqaerimental metastatic foci in the lung, but not in the liver. Furthermore, all other cell lines wherein the expression 
of mouse CMAP was below the detection limit were almost rejected and regressed after the transplantation through the 
tail vein of mice. These results indicated that the expression of mouse CMAP could be dosely associated with the 
metastasis of cancer cells to the liver. 

[0066] In addition, 15 tissues were surgically excised from two to three 5-week-old normal female CDF1 mice and 
disrupted using a Polytron homogenizer. The expression of mouse CMAP was confirmed by RT-PCR similarly as 
described above (Figure 2). Mouse CMAP was found to be expressed in the thymus, spleen, and mesenteric lynph 
nodes, although the expression was dearly weaker than for IMC-HAl cells. These results indicated that mouse CMAP 
could also physiologically function in normal tissues, profoundly associating with the immune mechanism. 

Example 8 Identification of human homologue of mouse CMAP 

(1) Database Search 

[0067] The present inventors searched for a human nucleotide sequence showing homology to that of mouse CMAP 
in tfie Genbank and EMBL databases with a Fast A using VMS (Version 8) software from Genetics Computer Group. 
The results revealed that two Expressed Sequence Tags (ESTs). N47763 and N56875, showed 75.7% and 72.0% 
homologies to mouse CMAP. respectively. These two ESTs hold a homologous sequence comprising 34 nucleotide res- 
idues, strongly indicating that they might be derived from human CMAP of the same origin. 



13 



EP 0 974 652 A1 



(2) Amplification of human CMAP by RT-PCR 

[0068] Since mouse CIVIAP was expressed in the spleen among nomial tissues of mice, the human spleen was cho- 
sen as the supply source of template DNA. A single-stranded cDNA was prepared from the total RNA (OriGene) (2 jig) 
5 extracted from the human spleen by a method similar to that in Example 1 (2). Based on sequences of N47763 and 
N56875. eight primers (SEQ ID NOs: H-CHAP-1 to 8) were designed. PGR was perfbrnied on the template single- 
stranded cDNA using two primers including H-CMAP-1 and H-CMAP-2 similarly as in Example 1 (5). Reaction was car- 
ried out in the sequence 94''C for 2 min; 30 cycles of 94°C for 30 sec. 60°C for 30 sec, and 72°C for 90 sec; and 72<'C 
for 5 min. The thermal cycler used was a PERKIN ELMER 9600. 

10 

(3) Sequencing 

[0069] TTie nucleotide sequence of the cDNA product of about 0.8 kb (SEQ ID NO: 37) derived from the amplified 
human CMAP was directly sequenced according to a method similar to that in Example 1 (6) using primers H-CMAP-1 
15 to 8 (SEQ ID NOs: 39 to 46). Human CMAP exhibited a 65.9% homology to mouse CMAR In addition, human CMAP 
was located in the nucleotide residues from 83 to 583, and its amino acid sequence (SEQ ID NO: 38) possessed a 
71 .6% homology to that of mouse CMAR Therefore, it was strongly indicated that these two proteins have approxi- 
mately the same function. 

20 (4) Comparison of amino add sequences of human CMAP with family 2 cystatin 

[0070] Amino add sequences of mouse and human CMAPs and human family 2 cystatin were oonpared by a multi- 
sequence comparison program, MegAlign, in the Lasergene (DNASTAR) (Figure 3). Consensus sequences of human 
family 2 cystatin (cystein residues at. positions 121. 132, 146, and 166; glycine residue at 59; QXVXQ at 103 to 107; 

25 and VFW at 1 53 to 1 55) were found to be minimally conserved in CMAPs. However, the overall homology was 30% or 
less in all CMAPs. In particular, sequences at the N-terminus were characteristic and long, and found to comprise many 
cysteine residues which may impart specificity to CMAR Theoretically, it is highly possible that N of position 84 is glyc- 
osylated, a characteristic not observed in other sequences in its vicinity. Although CMAPs hold sequences resembling 
those hitheto reported for cystatins, they were found to comprise nrany characteristic amino add residues, indicating 

30 the possibility of having properties differing from those of known substances. 

Example 9 DeCTease in cancerous metastasis potency of IMC-HA1 cells in mice due to the constitutive decrease of 
mouse CMAP 

35 [0071 ] It is generally agreed that alterations of the nature, quality or expression level of multiple overlapped genes are 
required tor metastasis. Therefore, it was thought to be difficult to induce a single metastasis-associated gene in cells 
with low metastasis potency in mouse to confirm the elevation of their metastasis ability. IMC-HA1 cells were therefore 
transformed with a vector in which a sequence complementary to the mouse CMAP mRNA was Inserted to construct 
stably transformed clonal cells, which were examined to determine the change in metastasis potency in mice. 

40 

(1) Construction of pBK-AS-CMAP vector 

[0072] The pBK-CMV expression vector was treated with restriction enzymes Nhel and Kpnl to prepare a linear vector 
whose lacZ promoter and multicloning sites were removed. The insert was prepared from the total mRNA of IMC-HA1 

45 cells by RT-PCR as the starting material. CMAP-F1 (SEQ ID NO:. 47) and CHAP-R1 (SEQ ID NO: 48) were used as a 
set of primers. The Nhel and Kpnl restriction enzyme recognition sites are introduced in these primers at their termini, 
and their amplification product corresponds to the entire noncoding and coding regions on the 5'-side of mouse CMAP. 
The amplification product of atiout 0.5 kb was digested with restriction enzymes Nhel and Kpnl, purified with a QIAquick 
PCR Purification Kit (QIAGEN). and llgated to the above-prepared linear vector using a DNA Ugation Kit (Takara 

50 Shuzo). In this way. the pBK-AS-CMAP vector that expresses mRNA acting as the antisense against mouse CMAP in 
cells was constructed. 

(2) Transfer of pBK-AS-CMAP vector into IMC-HA1 cells and selection of transformants 

55 [0073] After the pBK-AS-CMAP vector was introduced into £ coli competent cells and cultured on a large scale, plas- 
mids were prepared in large amounts according to the standard method using a QIAfilter Plasmid Midi Kit (QIAGEN). 
The plasmid (about 50 ji g) was deaved at one site with ApaLI to make it linear, purified by phenol extraction and eth- 
anol precipitation, and dissolved in EP buffer (50 nl) (containing 25 mM Hepes (pH 7.0). 137 mM NaCI. 5 mM KCI, 
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0.27mM Na2HP04, and 6mM dextrose). IMC-HA1 cells (about 1x10^ cells) at the logaiithmlc growth phase were 
recovered from the culture medium (100 ml) and resuspended in EP buffer to prepare a cell suspension of 4 x 10^ 
cells/ml. This cell suspension (450 ^1) was mixed witti the plasmid solution prepared above (50 ^ 1), and plasmids were 
transfected into cells by electropaation under the conditions of 250 mV, 500 iiF, and a single pulse using a GenePulser 
II (Bio-Rad). After the mixture was allowed to stand at room temperature for 10 min, cells were cultured under the usual 
conditions. After 24-hour culturing, transformed cells were continuously treated with neomycin (0.3 mg/ini) for 2 weeks 
to select steble transfbrmants. The cells were cloned by the limiting dilution method to obtain about 50 clonal cells. The 
cells in which the expression level of mouse CMAP was decreased were screened by RT-PCR. As a result, three lines 
of cell clones, 57C4, 53A9, and 53E9, were obtained. Similar procedures were also performed wHh pBK-CMV vector 
with no insert, and an 18B5 cell line was obtained as the vector control cell clone. When the expression of mouse CMAP 
mRNA in these cell lines was confirmed by Nortiiern blot analyses, said mRNA was expressed in the 18B5 cell line at 
almost the same level as that in the IMC-HAI cell line, but only about 49%, 32%, and 16% in 57C4, 53A9, and 53E9 
cell lines. No particular changes in the proliferation and morphology during culturing were noticed in any of these cell 
lines. 

(3) Corrfinnation of metastasis potency in vivo 

[0074] Metastasis abilities of cloned cell lines derived from IMC-HAI cells with different expression patterns of mouse 
CMAP mRNA were tested in mice by three independent experiment systems. In experiment I, each cell line (5 x 10* 
cells) was subcutaneously transplanted to mice. The liver was excised on the 15th day after the transplantation and 
subjected to observation of appearance, weight determination, and calculation of the survival rate. Experiment II was 
performed in the same matter as experiment I except that on the third day after the transplantation primary tumors at 
the transplantation site were excised togetiier witii proximal lymph nodes. In this experiment, a model of a minute met- 
astatic focus of IMC-HAI cells in the liver can be prepared. Furthermore, in experiment III, cells (1 x 10^ cells) were 
directiy transferred through the tail vein, and the liver was excised on the 13th day after transplantation. At the same 
time, a diachronic study of the survival was performed. The experimerrt revealed that the expression of mouse CMAP 
mRNA was closely associated with the formation of liver metastatic focus and survival of mice (Table 1). Note 1) in ttie 
table reixesents 'Number of mice which survived for 60 days/Number of mice tested." 



EP 0 974 652 A1 



Table 1 
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55 [0075] It is especially noteworthy that in experiment 1 1, one out of six mice In which 53A9 cells were transplanted and 
three (half) of six mice in which 53E9 cells were transplanted survived for 60 days or more without metastasis to the 
liver. These results suggested that the expression of mouse CMAP is profoundly associated with the formation of met- 
astatic focus in the liver, and that the gene acts mainly after cancer cells enter the vessel. 
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Industrial Apolicabilitv 



[0076] The present invention provides a novel protein associated with cancer metastasis and a DNA encoding said 
protein. This invention also provides a vector carrying said DNA, a transformant harboring said vector, and a method 
for preparing a recombinant protein by cutturing said transformant. In addition, this invention provides DNA to be used 
to detect, isolate, annpltfy, or suppress the expression of said gene. Furthernmre, this invention provides a method for 
screening cancer metastasis inhibitors utilizing said protein, enabling development of novel cancer metastasis inhibi- 
tors. 
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Sequence listing 

(1) Nane of Applicant: Banyu Pharaaceutical Co., Ltd. 

(2) Title of the Invention: Cancer Metastasis-associated Gene 
C3) Reference Niuber: Bl-OOIPCT 

(4) Application Nuaber: 

(5) ^Filing Date: 

(6) Country where the priority application was filed and the application nuaber 
of the applicatira: J^an, No. Hei 9-105333 

(7) Priority date: April 8, 1997 

(8) Nuaber of Sequences: 49 

S£Q ID NO: 1 
SEQUENCE LENGTH: 656 
SEQUENCE TYPE: nucleic acid 
STBANDEDNESS: double 
TOPOLOGY: linear 
MOLECULE TYPE: cDNA 
HYPOTHETICAL: No 
ANTISENSE: Yes 
ORIGINAL SOURCE: 

ORGANiai: Mus Musculus 

CEU LINE: IMC-HAl cell 

SEQUENCE DESCRIPTION: SEQ ID NO: 1 

TATTATGTTG AGTGATATTT TTTTTTTTAG GTGATCAGCT TTTTATTTAT GATAAATCAT 60 

CCCCATCAAT AHAGTCTGA TGGTAGCAGA CCAGACGGCT GCA6AATCTG CTCATGCAGT 120 

CGGCCATCAA CAAGCTTCCC ACACTACCAC CTTGTTGACT GTATCTGCTT TGCTCTGCAC 180 

GTGCTCCAGT AATGTCATCA AA6GACTCTC T6CGTGCCCA CATCCTGAAT CTA6GTGTGT 240 
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AGCACCAGA6 AATTCATnT AGAAGAAAGT GTGAATACAG HAATCTGCA 6AGGAGGGCA 300 

CTATATTCTC CCAGTCCAGC CCCCTGATGA CCCACTGGCA TCTTAATCTG CTCGCTGGTA 360 

AGCAGGCCCT 6AG6ATCT6A GATG6CCAGG ATGGGGTGGA AA6TAAGGA6 TGGGGAAAAC 420 

ACGAGTGGAT ATGTATCACA GCTGCAGTCT T6CTAAAGAG ACAGAAGTCA CTGGCA6AGG 480 

AGAACAGGCA CCTCAAAACT GTGGAGCCAG GGGATGACCC AGACTTCAGA GTAGCAATAT 540 

10 AGAGTCCGCT TCAAGGCAGG GnGGniGG AAGTCACAGT TGTCCAGTT6 GTGATGCATG 600 

6TCTTCCTGC ATGTAGTTCG GCCCAnnC ACCTCCAGCA ITTAGTGAGG 6TTAAT 656 

SEQ ID NO: Z 
SEQUENCE LENGTH: 369 
SEQUENCE TYPE: nucleic acid 
STRANDEDNESS: double 
20 TOPOLOGY: linear 

• MOLECULE TYPE: cDNA 
HYPOTHETICAL: No . 
ANTl SENSE: Yes 
25 ORIGINAL SOURCE: 

ORGANISI: Mus Musculus 
CELL LINE: IMC-HAl cell 

^ SEQUENCE DESCRIPTION: SEQ ID NO: 2 

CATTATGCTG AGTGATATCT TTTTTTnGC ATAGAGAAAT GAATGCGTH ATTTATTTGC 60 

TCTCCAACAG CAmCCCTA AAGCTCGGTG AAT6CTGAGC CAAAAGGAGA TGHAGTCGT 120 

35 AAAACAGTAA CCCTGGTCCA CAGGATCCAA GCCTGGGATA CAGGGAGCCG G6AGCCATGG 180 

AACCCGGAGC AG6GACGCCT GGGAGCAGGG AAGTAGTCTC CAGAGCACAC CGAGCAC6AG 240 

TCCAGAGCCG GC6CCAATGT CAGGGATCAG GCTCACATCG GGGGCAGGCA GACmCCCA 300 

CATATACCCT CACAGCACH GTATTTGCCG TCACACTGCC CGTCCCTCCA GCATTTAGTG 360 

AGGGTTAAT 369 

SEQ ID NO: 3 
^ SEQUENCE LENGTH: 983 

SEQUENCE TYPE: nucleic acid 

STRANDEDNESS: double 

TOPOLOGY : linear 
50 MOLECULE TYPE: cDNA to iRNA 

HYPOTHETICAL: No " 
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ANTISENSE: No 

0EI6INAL SOURCE: 
^ ORGANISM: Mus Husculus 

CELL LINE: IMC-HAl cell 

IMMEDIATE SOURCE 
,^ LIBRARY : IMC-HAl cell-derived cDNA library 

CLONE :23-l#2 

FEATURE: 

NAME/KEY : CDS 
IS LOCATION : 17.. 520 

IDENTIFICATION METHOD: E 

SEQUENCE DESCRIPTION: SEQ ID NO: 3 
^ CTGAAGCTAC CCCACC ATG CCC TGG TCC TGG AGC TGT ACT TGC CGA GCA CTT 52 

Met Pro Trp Ser Trp Ser Cys Thr Cys Arg Ala Leu 
1 5 10 

2s GGC CCA CTA CAT GCT CCC TGC An TCC CCA GCC ATG TGG CTG GCC AH 100 

Gly Pro Leu His Ala Pro Cys lie Ser Pro Ala Met Trp Leu Ala He 

15 20 25 

CTG CTT GCC CTC TGC TGC CTA ACT TCT GAC ACC CAT GGG GCA CGC CCC 148 
30 Leu Leu Ala Leu Cys Cys Leu Thr Ser Asp Thr His Gly Ala Arg Pro 

30 35 40 

CCA GAT Tn TGT TCC AAA GAT TTG ATC TCC ACT GT6 AAG CCA 6GA TtC 196 
Pro Asp Phe Cys Ser Lys Asp Leu He Ser Ser Val Lys Pro Gly Phe 

35 

45 50 55 60 

CCC AAA ACA ATA GAG ACC AAT AAC CCA 6GA GT6 CTt AAG GCC GCC AGG 244 
Pro Lys Thr lie Glu Thr Asn Asn Pro Gly Val Leu Lys Ala Ala Arg 
40 65 70 75 

CAC AGT GTG GAA AAG TTC AAC AAC TGC ACA AAT GAC ATC fTT TTG TTC 292 
His Ser Val Glu Lys Phe Asn Asn Cys Thr Asn Asp I le Phe Leu Phe 
80 85 90 

^ AAG GAG TCC CAT GTC AGC AAA GCC CTG GTA CAG GTG GTG AAA GGC CTG 340 

Lys Glu Ser His Val Ser Lys Ala Leu Val Gin Val Val Lys Gly Leu 
95 100 105 

50 AAA TAT ATG CTG GAG GTG AAA ATC GGC CGA ACT ACA TGC AGG AAG ACC 388 

Lys Tyr Met Leu Glu Val Lys He Gly Arg Thr Thr Cys Arg Lys Thr- 
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no 115 120 

ATG CAT CAC CAA CTG GAC AAC TGT GAC TTC CAA ACC AAC CCT GCC TTG 436 
Met His His 6In Leu Asp Asn Cys Asp Phe Gin Thr Asn Pro Ala Leu 
125 130 135 140 

AAG CG6 ACT CTA TAT TGC TAC TCT GAA GTC T6G GTC ATC CCC TGG CTC 484 
Lys Arg Thr Leu Tyr Cys Tyr Ser GIu Val Trp Val He Pro Trp Leu 

145 150 155 

CAC AGT TTT GAG GTG CCT GTT CTC CTC TGC CAG TGACTTCTGT CTCTTTAGCA 537 
His Ser Phe Glu Val Pro Val Leu Leu Cys Gin 

160 165 
AGACTGCAGC TGTGATACAT ATCCACTCGT GTTTTCCCCA CTCCTTACTT TCCACCCCAT 597 
CCTGGCCATC TCACATCCTC AGGGCCTGCT TACCAGCCAG CAGATTAACA TGCCAGTGGG 657 
TCATCACGG6 GCTGGACTGG GAGAATATAG TGCCCTCCTC TCCA6ATTAA CTGTATTCAC 717 
ACTTTCTTCT AAAATGAATT CTCTGGTGCT ACACACCTAG ATTCAGGATG TGGGCACGCA 777 
GAGAGTCCTT TGATGACATT ACTGGAGCAC GTGCAGAGCA AAGCAGATAC AGTCAACAAG 837 
GTGGTAGTGT GGGAAGCTTC HGATGCCCG ACTGCATGAG CAGAHCTGC AGCCGTCTGG 897 
2s TATGCTACCA TCAGACTAAT AnGATGGGG ATGATTTATC ATAAATAAAA AGCTGATCAC 957 

CTAAAAGCAA AAAAAAAAAA AAAAAA 983 



10 



15 



20 



30 



35 



40 



45 



m ID NO: 4 
SEQUENCE LENGTH: 167 
SEQUENCE TYPE: anino acid 
TOPOLOGY: linear 
MOLECULE TYPE: protein 
ORIGINAL SOUBCE: 

ORGANISM: Mus Musculus 
CELL LINE: IMC-HAl cell 



SEQUENCE DESCRIPTION: SEQ ID NO: 4 

Met Pro Trp Ser Trp Ser Cys Thr Cys Arg Ala Leu 
1 5 10 

Gly Pro Leu His Ala Pro Cys He Ser Pro Ala Met Trp Leu Ala lie 

15 . 20 25 

Leu Leu Ala Leu Cys Cys Leu Thr Ser Asp Thr His Gly Ala Arg Pro 
50 30 35 40 

Pro Asp Phe Cys Ser Lys Asp Leu He Ser Ser Val Lys Pro Gly Phe 
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45 50 55 60 

Pro Lys Thr He Glu Thr Asn Asn Pro Gly Val Leu Lys Ala Ala Arg 

65 70 75 

His Ser Val Glu Lys Phe Asn Asn Cys Thr Asn Asp He Phe Leu Phe 

80 85 90 

Lys Glu Ser His Val Ser Lys Ala Leu Val Gin Val Val Lys Gly Leu 

95 100 105 

Lys Tyr Met Leu Glu Val Lys He Gly Arg Thr Thr Cys Arg Lys Thr 

110 115 120 

Met His His Glh Leu Asp Asn Cys Asp Phe Gin Thr Asn Pro Ala Leu 
125 130 135 140 

Lys Arg Thr leu Tyr Cys Tyr Ser Glu Val Trp Val He Pro Trp Leu 

145 150 155 

His Ser Phe Glu Val Pro Val Leu Leu Cys Gin 
160 165 



SEQ ID NO: 5 
SEQUENCE LENGTH: 691 
SEQUENCE TYPE: nucleic acid 
STRANDEDNESS: double 
TOPOLOGY: linear 
MOLECULE TYPE: cDNA to iRNA 
HYPOTHETICAL: No 
ANTISENSE: No 
ORIGINAL SOURCE: 

ORGANISM: Has Musculus 

CELL LINE: IMC-HAl cell 
IMMEDIATE SOURCE: 

LIBRARY: IMC-HAl cell-derived cDNA library 

CLONE: 28-1 #3 
FEATURE: 
NAME/KIY: CDS 
LOCATION: 37.. 429 
IDENTIFICATION METHOD: E 

SEQUENCE DESCRIPTION: SEQ ID NO: 5 
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10 



20 



CTCGGGACTG GTCATCAGAG CTCCCCTGCC HCACC ATG AAG TCC TGC GGC CTT 54 

Met Lys Ser Cys Gly Leu 
1 5 

nA CCT TTC ACG GT6 CTC OT GCT CTG GGG ATC CTG QCk CCC TGG ACT 102 
Leu Pro Phe Thr Val Leu leu Ala Leu Gly lie Leu Ala Pro Trp Thr 

10 15 20 

GTG GAA GCA GGC AAA AAT GAT GCT ATC AAA ATC GGA GCC TGC CCT GCT 150 
Val Clu Gly Gly Lys Asn Asp Ala lie Lys He Gly Ala Cys Pro Ala 

25 30 35 

AAA AAG CCT CCC CAG TGC CH AAG CTT GAG AAG CCA CAA TGC CGT ACT 198 
Lys Lys Pro Ala Gin Cys Leu Lys Leu Glu Lys Pro Gin Cys Arg Thr 

40 45 50 

6AC TGG CAG TGC CC6 GGA AAG CAG AGG TGC TGC CAA GAT GCT. TGC GGT 246 
Asp Trp Glu Cys Pro Gly Lys Gin Arg Cys Cys Gin Asp Ala Cys Gly 
55 60 65 70 

TCC AAG TGC GTG AAT CCT GTT CCC AH CGC AAA CCA GTG TGG AGG AAG 294 
25 Ser Lys Cys Val Asn Pro Val Pro He Arg Lys Pro Val Trp Arg Lys 

75 80 85 

CCT GGG AGG TGC GTC AAA ACT CAG GCA AGA TGT ATG ATG CTT AAC CCT 342 
„Pro Gly Arg Cys Val Lys Thr Gin Ala Arg Cys Met Met Leu Asn Pro 
90 95 100 

CCC AAT GTC TGC CAG AGG GAC GGG CAG TGT GAC GGC AAA TAC AAG TGC 390 
Pro Asn Val Cys Gin Arg Asp Gly Gin Cys Asp Gly Lys Tyr Lys Cys 

105 110 115 

TGT GAC GGT ATA TGT GGG AAA GTC TGC CTG CCC CCG ATG TGAGCCTGAT . 439 
Cys Glu Gly He Cys Gly Lys Val Cys Leu Pro Pro Met 

120 125 130 

CCCTGACATT GCCGCCGGCT CTGGACTCGT GCTC6GTGTG CTCTG6AAAC TACTTCCCTG 499 
CTCCCA6GC6 TCCCTGCTCC G6GTTCCATG GCTCCCGGCT CCCT6TATCC CAGGCnOGA 559 
TCCTGTGGAC CAGGGTTACT GTTTTACCAC TAACATCTCC nnGGCTCA GCATOACCG 619 
AGCnTAGGG AAATGCTCn GGAGAGCAAA TAAATAAACG CArTCAmC TCTAAAAAAA 679 
AAAAAAAAAA AA 691 



30 



35 



45 



SO 



55 



SEQ ID NO: 6 
SEQUENCE LENGTH: 131 
SEQUENCE TYPE: anino acid 
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TOPOLOGY: linear 
MOLECULE TYPE: protein 
ORIGINAL SOUECE: 

ORGANISM: Mus Musculus 

CaL LINE: IMC-HAl cell 
SEQUENCE DESCRIPTION: SEQ ID NO: 6 

Met L/s Ser Cys Gly Leu 
1 5 
Leu Pro Phe Thr Val Leu Leu Ala Leu Gly lie Leu Ala Pro Trp Thr 

10 15 20 

Val 61u Gly Gly Lys Asn Asp Ala He Lys He Gly Ala Cys Pro Ala 

25 30 35 

Lys Lys Pro Ala Gin Cys Leu Lys Leu Glu Lys Pro GIq Cys Arg Thr 

40 45 50 

Asp Trp Glu Cys Pro Gly Lys Gin Arg Cys Cys Gin Asp Ala Cys Gly 
55 60 65 70 

Ser Lys Cys Val Asn Pro Val Pro He Arg Lys Pro Val Trp Arg Lys 

75 80 85 

Pro Gly Arg Cys Val Lys Thr Gin Ala Arg Cys Met Met Leu Asn Pro 

90 95 100 

Pro Asn Val Cys Gin Arg Asp Gly Gin Cys Asp Gly Lys Tyr Lys Cys 

105 110 lis 

Cys Glu Gly He Cys Gly Lys Val Cys Leu Pro Pro Met 
120 125 130 

SEQ ID HO: 7 
SEQUENCE LENGTH: 1114 
SEQUENCE TYPE: nucleic acid 
STRANDEDNESS: double 
TOPOLOGY: linear 
MOLECULE TYPE: cDNA 
HYPOTHETICAL: No 
ANTISENSE: No 
ORIGINAL SOURCE: 

0R6ANI9(: Mus Musculus 

CELL LINE: IMC-HAl cell 



24 



EP 0 974 652 A1 



IMMEDIATE SOURCE: 

LIBRARY: IMC-HAl cell-derived cDNA library 
* CLONE: #8.323 

SEQUENCE DESCRIPTION: SEQ ID NO: 7 

,0 CCAGAGGCCA GTTTGGGATG CCAAACCCCT ACCTAACCAG AAGAAGAGAA GAAAGGCCAC 60 

TGCCGAGGTC ACTTCCAGTA CTTGGAGGAG AAAGCAAC6T TCCCATTTAC AGCTGAGTAA 120 

CAGGAGCCAC AAGGTATGTC TGACTCAAAA GTTCAGGCTC TCGATGACT6 TGCGGTGCTG 180 

CCCAGTGTGT CTTCTICAAT GTAACCTCAG GACCTAGAAC AGCACCTTGC ATGTGCTCTC 240 

AGGTGCmC TCTGATGGCC TCATGGTCCT GCCTGAAACA GAAAGTCTGC CACCTACTTC 300 

TGTAGCAGCA AGACTCCTGT TCTGTGGCTA AGCnCCTGC CTGTGCAAGA GCCACAGGGA 360 

GGGGCCAAAT GCATGCCACT GGGGCCACGC CTCCTGGTAA AGACATAAAT AGTGATCCTC 420 

20 GGGACTGGTC ATCAGAGCTC CCCTGCCnC ACCATGAAGT CCTGCGGCCT TnACCTTTC 480 

ACGGTGCTCC TTGCTCTGGG GATCCTGGCA CCCTGGACT6 TGGAAGGAG6 CAAAAATGAT 540 

GCTATCAAAA TCGGAGCCTG CCCTGCTAAA AAGCCTGCCC AGTGCCTTAA GCHGAGAAG 600 

CCACAATGCC GTACTGACTG GGAGTGCCCG GGAAAGCAGA GGTGCTGCCA AGATGCTTGC 660 

« GGTTCCAAGT CCGTGAAT€C TGTTCCCATT CGCAAACCAG TGTGGAGGAA GCCTG6GAGG 720 

TGCGTCAAAA CTCAGGCAAG ATGTATGATG CTTAACCCTC CCAATGTCTG CCAGAGGGAC 780 

GGGCAGTGTG ACGGCAAATA CAAGTGCTGT 6AGGGTATAT GTG6GAAAGT CTGCCTGCCC 840 

CCGATGTGAG CCTGATCCCT GACAnCGCG CCGGCTCTGG ACTCGTGCTC G6TGTGCTCT 900 

30 

GGAAACTACT TCCCTGCTCC CAGGCGTCCC TGCTCCGGGT TCCATGGCTC CCGGCTCCCT 960 

GTATCCCA6G CHGGATCCT GTGGACCAGG GnACTGTTr TACCACTAAC ATCTCCTTTT 1020 

GGCTCAGCAT TCACCGAGCT mOGGAAAT GCTGnGGAG AGCAAATAAA TAAACGCATT 1080 

35 CATTTCTCTA TGCAAAAAAA AAAAAAAAAA AAAA 1114 

SEQ ID NO: 8 
SEQUENCE LENGTH: 684 

40 

SEQUENCE TYPE: nucleic acid 
STRANDEDNESS: double 
TOPOLOGY: linear 
4s MOLECULE TYPE: cDNA to uRNA 

HYPOTHETICAL: No 
ANTISENSE: No 
ORIGINAL SOURCE: 

so 

ORGANISM: Mus Musculus 
CELL LINE: IMC-HAl cell 
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ORIGINAL SOURCE: 

LIBEARY: IMC-HAl cell-derived cDNA library 

CLONE: 28-1 #3 
FEATURE: 
NAME/KEY: CDS 
LOCATION: 40.. 420 
IDENTIFICATION HEIBOD: E 



SEQUENCE DESCRIPTION: SEQ ID NO: 8 

TGnAHAAT GATCTGGAAG CACCAGAGGC CAGrTTGGG ATG CCA AAC CCC TAG CTA 57 

Met Pro Asn Pro Tyr Leu 
1 5 

ACC AGA AGA AGA GAA GAA AGG CCA CTG CCG AGG TCA CTT CCA 6TA Cn 105 
Thr Arg Arg Arg Glu Glu Arg Pro Leu Pro Arg Ser Leu Pro Val Leu 

10 15 20 

GCA GAT GCT ATC AAA ATC GGA GCC TGC CCT GCT AAA AAG CCT GCC GAG 153 
Gly Asp Ala lie Lys He Gly Ala Cys Pro Ala Lys Lys Pro Ala Gin 

25 30 35 

TGC CTT AAG CTT GAG AAG CCA CAA TGC CGT ACT GAC TGG GAG TGC CCG 201 
Cys Leu Lys Leu Glu Lys Pro Gin Cys Arg Thr Asp Trp Glu Cys Pro 

40 45 50 

GGA AAG CAG AGG TGC TGC CAA GAT GCT TGC GGT TCC AAG TGC GTG AAT 249 
Gly Lys Gin Arg Cys Cys Gin Asp Ala Cys Gly Ser Lys Cys Val Asn 
55 60 65 70 

CCT GTT CCC ATT CGC AAA CCA GTG TGG AGG AAG CCT GGG AGG TGC GTG 297 
Pro Val Pro lie Arg Lys Pro Val Trp Arg Lys Pro Gly Arg Cys Val 

75 80 85 

AAA ACT CAG GCA AGA TGT ATC ATG CTT AAG GCT CGC AAT GTG TGG GAG 345 
Lys Thr Gin Ala Arg Cys Met Met Leu Asn Pro Pro Asn Val Cys Gin 

90 95 100 

AGG GAC GGG CAG TGT GAC GGC AAA TAG AAG TGC TGT GAG GGT ATA TGT 393 
Arg Asp Gly Gin Cys Asp Gly Lys Tyr Lys Cys Cys Glu Gly He Cys 

105 110 115 

GGG AAA GTC TGC CTG CCC CCG ATG TGAGCCTGAT CCGTGACATT G6CGCCGGCT 447 
Gly Lys Val Cys Leu Pro Pro Met 
120 125 
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IS 



25 



CTG6ACTCGT GCTCGGTGT6 CTCTGGAAAC TACTTCCCTG CTCCCAGGCG TCCCT6CTCC 507 

GGGTTCCATG GCTCCCGGCT CCCTGTATCC CAGGCTTGGA TCCTGTGGAC CAGGGHACT 567 

* GTTnACCAC TAACATCTCC TTTTCGCTCA 6CATTCACCG AGCmAGGG AAATGCTGH 627 

GtAGAGCAiA TAAATAAACG CATTCATnC TCTATGCAAA AAAAAAAAAA AAAAAAA 684 

10 SEQ ID NO: 9 

SEQUENCE LENGTH: 126 
SEQUENCE TYPE: aaino acid 
TOPOLOGY: linear 
MOLECULE TYPE: protein 
ORIGINAL SOURCE: 

ORGANISM: Mus Husculus 
20 CELL LINE: IMC-IlAl cell 

SEQUENCE DESCRIPTION: SEQ ID NO: 9 

Met Pro Asn Pro Tyr Leu 
1 5 
Thr Arg Arg Arg Glu Glu Arg Pro Leu Pro Arg Ser Leu Pro Val Leu 
10 15 20 

) 30 Gly Asp Ala He Lys He Gly Ala Cys Pro Ala Lys Lys Pro Ala Gin 

25 30 35 

Cys Leu Lys Leu Glu Lys Pro Gin Cys Arg Thr Asp Trp Glu Cys Pro 

40 45 50 

Gly Lys Gin Arg Cys Cys Gin Asp Ala Cys Gly Ser Lys Cys Val Asn 
55 60 65 . 70 

Pro Val Pro He Arg Lys Pro Val Trp Arg Lys Pro Gly Arg Cys Val 

75 80 85 

lys Thr Gin Ala Arg Cys Met Met Leu Asn Pro Pro Asn Val Cys Gin 

90 95 100 

Arg Asp Gly Gin Cys Asp Gly Lys Tyr Lys Cys Cys Glu Gly He Cys 

105 110 115 

Gly Lys Val Cys Leu Pro Pro Met 
120 . 125 



35 



40 



45 



50 



55 



SEQ ID NO: 10 
SEQUENCE LENGTH: 25 
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SEQUENCE TYPE: nucleic acid 

STRANDEONESS: single 

TOPOLOGY: linear 

KOLECULE TYPE: synthetic DNA 

SEQUENCE DESCaiPTION: SEQ ID NO: 10 

ATTAACCCTC ACTAAATGCT GGGBA 

SEQ ID NO: 11 

SEQUENCE LENGTH: 25 

SEQUENCE TYPE: nucleic acid 

STBANDEDNESS: single 

TOPOLOGY: linear 

MOLECULE TYPE: synthetic DNA 

SEQUENCE DESCRIPTION: SEQ ID NO: 11 

AnAACCCTC ACTAAATGCT GGAGG 

SEQ ID NO: 12 

SEQUENCE LENGTH: 25 

SEQUENCE TYPE: nucleic acid 

STRANDEDNESS: single 

TOPOLOGY: linear 

MOLECULE TYPE: synthetic DNA 

SEQUENCE DESCRIPTION: SEQ ID NO: 12 

AHAACCCTC ACTAAATGCT GGT6G 

SEQ ID NO: 13 

SEQUENCE LENGTH: 25 

SEQUENCE TYPE: nucleic acid 

STRANDEDNESS: single 

TOPOLOGY: linear 

MOLECULE TYPE: synthetic DNA 

SEQUENCE DESCRIPTION: SEQ ID NO: 13 

ATTAACCCTC ACTAAATGCT G6TAG 

SEQ ID NO: 14 
SEQUENCE LENGTH: 26 
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SEQUENCE TYPE: nucleic acid 

STBANDEDNESS: single 

TOPOLOGY: linear 

MOLECULE TYPE: synthetic DNA 

SEQUENCE DESCRIPTION: SEQ ID NO: 14 

ATTAACCCTC ACTAAAGATC T6ACT6 

SEQ ID NO: 15 

SEQUENCE LENGTH: 25 

SEQUENCE TYPE: nucleic acid 

STBANDEDNESS: single 

TOPOLOGY: linear 

MOLECULE TYPE: synthetic DNA 

SEQUENCE DESCRIPTION: SEQ ID NO: 15 

AnAACCCTC ACTAAATGCT 6GGTG 

SEQ ID NO: 16 

SEQUENCE LENGTH: 25 

SEQUENCE TYPE: nucleic acid 

STRANDEDNESS: single 

TOPOLOGY: linear 

MOLECULE TYPE: synthetic DNA 

SEQUENCE DESCRIPTION: SEQ ID NO: 16 

ATTAACCCTC ACTAAATGCT 6TATG 

SEQ ID NO: 17 

SEQUENCE LENGTH: 25 

SEQUENCE TYPE: nucleic acid 

STRANDEDNESS: single 

TOPOLOGY: linear 

MOLECULE TYPE: synthetic DNA 

SEQUENCE DESCRIPTION: SEQ ID NO: 17 

ATTAACCCTC ACTAAATGGA GCT6G 

SEQ ID NO: 18 
SEQUENCE LENGTH: 25 
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SEQUENCE TYPE: nucleic acid 

STRANDEDNESS: single 

TOPOLOGY: linear 

MOLECULE TYPE: synthetic DNA 

SEQUENCE DESCRIPTION: SEQ ID NO: 18 

AnAACCCTC ACTAAATGTG GCAGG 

SEQ 10 NO: 19 

SEQUENCE LENGTH: 25 

SEQUENCE TYPE: nucleic acid 

STRANDEDNESS: single 

TOPOLOGY: linear 

MOLECULE TYPE: synthetic DMA 

SEQUENCE DESCRIPTION: SEQ ID NO: 19 

AHAACCCTC ACTAAAGCAC CGTCC 

SEQ ID NO: 20 

SEQUENCE LENGTH: 30 

SEQUENCE TYPE: nucleic acid 

STRANDEDNESS: single 

TOPOLOGY: linear 

MOLECULE TYPE: synthetic DNA 

SEQUENCE DESCRIPTION: SEQ ID NO: 20 

CAmiGCTG AGTGATATCT TTTTTTTTAA 

SEQ ID NO: 21 

SEQUENCE LENGUI: 30 

SEQUENCE TYPE: nucleic acid 

STRANDEDNESS: single 

TOPOLOGY: linear 

MOLECULE TYPE: synthetic DNA 

SEQUENCE DESCRIPTION: SEQ ID NO: 21 

CAnATGCTG AGTGATATCT TTTTTTTTAC 

SEQ ID NO: 22 
SEQUENCE LENGTH: 30 
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STBANDEDNESS: single 

TOPOLOGY: linear 

MOLECULE TYPE: synthetic DNA 

SEQUENCE DESCRIPTION: SEQ ID NO: 22 

CATTATGCTG AGTGATATCT TTTTTTTTAG 

SEQ ID NO: 23 

SEQUENCE LENGTH: 30 

SEQUENCE TYPE: nucleic acid 

STRANDEDNESS: single 

TOPOLOGY: linear 

MOLECULE TYPE: synthetic DNA 

SEQUENCE DESCRIPTION: SEQ ID NO: 23 

CATTATGCTG AGTGATATCT TTTTTTTTCA 

SEQ ID NO: 24 

SEQUENCE LENGTH: 30 

SEQUENCE TYPE: nucleic acid 

STRANDEDNESS: single 

TOPOLOGY: linear 

MOLECULE TYPE: synthetic DNA 

SEQUENCE DESCRIPTION: SEQ ID NO: 24 

CATTATGCTG AGTGATATCT TTTTTTTTCC 

SEQ ID NO: 25 

SEQUENCE LENGTH: 30 

SEQUENCE TYPE: nucleic acid 

STRANDEDNESS: single 

TOPOLOGY: linear 

MOLECULE TYPE: synthetic DNA 

SEQUENCE DESCRIPTION: SEQ ID NO: 25 

CATTATGCTG AGTGATATCT TTTTTTTTCG 

SEQ ID NO: 26 
SEQUENCE LENGTH: 30 
STRANDEDNESS: single 
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TOPOLOGY: linear 
MOLECULE TYPE: synthetic DNA 
SEQUENCE DESCRIPTION: SEQ ID NO: 26 
CAHATGCTG AGTGATATCT TTTTTTTTGA 

SEQ ID NO: 27 

SEQUENCE LENGTH: 30 

SEQUENCE TYPE: nucleic acid 

STEANDEDNESS: single 

TOPOLOGY: linear 

MOLECULE TYPE: synthetic DNA 

SEQUENCE DESCRIPTION: SEQ ID NO: 27 

CAnATGCTG AGTGATATCT TTTTTTrTGC 

SEQ ID NO: 28 

SEQUENCE LENGTH: 30 

SEQUENCE TYPE: nucleic acid 

STEANDEDNESS: single 

TOPOLOGY: linear 

MOLECULE TYPE: synthetic DNA 

SEQUENCE DESCRIPTION: SEQ ID NO: 28 

CAHATGCTG AGTGATATCT TTTTTTTTGG 

SEQ ID NO: 29 

SEQUENCE LENGTH: 20 

SEQUENCE TYPE: nucleic acid 

STEANDEDNESS: single 

TOPOLOGY: linear 

MOLECULE TYPE: synthetic DNA 

SEQUENCE DESCRIPTION: SEQ ID NO: 29 

CAGAATCTGC TCATGCAGTC 

SEQ ID NO: 30 
SEQUENCE LENGTH: 20 
SEQUENCE TYPE: nucleic acid 
STEANDEDNESS: single 
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TOPOLOGY: linear 
MOLECULE TYPE: synthetic DNA 
SEQUENCE DESCRIPTION: SEQ ID NO: 30 
CACTCCTTAC TTTCCACCCC 

SEQ ID NO: 31 

SEQUENCE LENGTH: 23 

SEQUENCE TYPE: nucleic acid 

STRANDEDNESS: single 

TOPOLOGY: linear 

MOLECULE TYPE: synthetic DNA 

SEQUENCE DESCRIPTION: SEQ ID NO: 31 

AACAGCATTT CCCTAAAGCT CGG 

SEQ ID NO: 32 

SEQUENCE LENGTH: 23 

SEQUENCE TYPE: nucleic acid 

STRANDEDNESS: single 

TOPOLOGY: linear 

MOLECULE TYPE: synthetic DNA 

SEQUENCE DESCRIPTION: SEQ ID NO: 32 

TGGAAACTAC TTCCCTGCTC CCA 

SEQ ID NO: 33 

SEQUENCE LENGTH: 20 

SEQUENCE TYPE: nucleic acid 

STRANDEDNESS: single 

TOPOLOGY: linear 

MOLECULE TYPE: synthetic DNA 

SEQUENCE DESCRIPTION: SEQ ID NO: 33 

AAHAACCCT CACTAAAGGG 

SEQ ID NO: 34 
SEQUENCE LENGTH: 22 
SEQUENCE TYPE: nucleic acid 
STRANDEDNESS: single 
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TOPOLOGY; linear 
HOLECUIE TYPE: synthetic DNA 
SEQUENCE DESCRIPTION: SEQ ID NO: 34 
GTAATACGAC TCACTATAGG GC 

SEQ ID NO: 35 

SEQUENCE LENGTH: 26 

SEQUENCE TYPE: nucleic acid 

STBANDEDNESS: single 

TOPOLOGY: linear 

MOLECULE TYPE: synthetic DNA 

7F2 

SEQUENCE DESCRIPTION: SEQ ID NO: 35 
CTCGGGACTG GTCATCAGAG CTCCCC 

SEQ ID NO: 36 

SEQUENCE LENGTH: 24 

SEQUENCE TYPE: nucleic acid 

STRANDEDNESS: single 

TOPOLOGY: linear 

MOLECULE TYPE: synthetic DNA 

SEQUENCE DESCRIPTION: SEQ ID NO: 37 

GACTCTATCT GCTTTGCTCT GCAC 

SEQ ID NO: 37 

SEQUENCE LENGTH: 800 

SEQUENCE TYPE: nucleic acid 

STRANDEDNESS: double 

TOPOLOGY: linear 

MOLECULE TYPE: cDNA to aRNA 

HYPOTHETICAL: 

ANTISENSE: No 

ORIGINAL SOURCE: 

ORGANISM: Mus Muscukus 
CELL LINE: IMC-HAI cell 

FEATURE: 



34 



EP 0 974 652 A1 

NAMEAEY: CDS 
LOCATION: 83.. 583 
IDENTIFICATION METHOD : E 

SEQUENCE DESCRIPTION: SEQ ID NO: 37 

ACAGACCACT GCCCCCACCT GCCCTGCGCC ATCTACCCAA GAAGGCTC6G CACGGGCACC 60 
AACCACTGCC TCCAACTGCC CC AT6 CTG CCT GAG AAG GCA CTG CAC GGC CAC 112 



Met Leu Pro Glu Lys Ala Leu His Gly His 
1 5 10 



15 


CCC 


CAA 


CTG 


CCC 


CGC 


ACT 


GTC 


CCT 


ACC 


CGG 


GCA GCC 


ATG 


CGA 6C6 


GCT 


160 




Pro 


Gin 


Leu 


Pro 


Arg 
15 


Thr 


Val 


Pro 


Thr 


Arg 
20 


Ala Ala 


Met 


Arg Ala 

25 


Ala 




20 


GGA 


ACT 


CTG 


CTG 


6CC 


TTC 


TGC 


TGC 


CTG 


GTC 


TTG AGC 


ACC 


ACT GGG 


GGC 


208 




Gly 


Thr 


Leu 


Leu 
30 


Ala 


Phe 


Cys 


Cys 


Leu 
35 


Val 


Leu Ser 


Thr 


Thr Gly 
40 


Gly 






CCT 


TCC 


CCA 


GAT 


ACT 


TGT 


TCC 


CAG 


GAC 


CTT 


AAC TCA 


CGT 


GTG AAG 


CCA 


256 


2S 


Pro 


Ser 


Pro 
45 


Asp 


Thr 


Cys 


Ser 


Gin 
50 


Asp 


Leu 


Asn Ser 


Arg 

55 


Val Lys 


Pro 






liuA 


TTT 




AAA 

AAA 


APA 

ALA 


ATA 


A AP 

AAu 


APP 

ALL 


A AT 

AAT 


Pip 


CLA uuA 


GTC 


CTC CAA 


GCA 


304 


30 




roe 


rro 


Lys 


inr 


Tin 

lie 


Lys 


inr 


Asn 


Asp 


rro uiy 


Val 


Leu Gin 


A 1.. 

Ala 








bu 










bb 








70 












Gcg- 


-AGA 


TAC 


AGT 


GTT 


GAA 


AAG 


TTC 


AAC 


AAC 


TGC ACG 


AAC 


GAC ATG 


TTC 


352 




Ala 


Arg 


Tyr 


Ser 


Val 


Glu 


Lys 


Phe 


Asn 


Asn 


Cys Thr 


Asn 


Asp Met 


Phe 




35 


75 










80 










85 






90 






TTG 


TTC 


AAG 


GAG 


TCC 


CGC 


ATC 


ACA 


AG6 


GCC 


CTA GTT 


CAG 


ATA GTG 


AAA 


400 




Leu 


Phe 


Lys 


Glu 


Ser 


Arg 


He 


Thr 


Arg 


Ala 


Leu Val 


Gin 


He Val 


Lys 




40 










95 










100 






105 






GGC 


CTG 


AAA 


TAT 


ATG 


CTC 


GAG 


6TG 


GAA 


An 


GGC AGA 


ACT 


ACC TGC 


AAG 


448 




Gly 


Leu 


Lys 


Thr 
110 


Met 


Leu 


Glu 


Val 


Glu 
115 


He 


Gly Arg 


Thr 


Thr Cys 
120 


Lys 




45 


AAA 


AAC 


CA6 


CAC 


CTG 


CGT 


CTG 


GAT 


GAC 


TGT 


GAC nc 


CAA 


ACC AAC 


CAC 


496 




Lys 


ASD 


Gin 
125 


His 


Leu 


Arg 


Leu 


Asp 
130 


Asp 


Cys 


Asp Phe 


Gin 
135 


Thr Asn 


His 




SO 


AGO 


m 


AAG 


CAG 


ACT 


CTG 


AGC 


TGC 


TAC 


TCT 


GAA GTC 


TGG 


GTC GTG 


CCC 


544 


Thr 


Leu 


Lys 


Gin 


Thr 


Leu 


Ser 


Cys 


Tyr 


Ser 


Glu Val 


Trp Val Val 


Pro 








140 










145 








150 











55 
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15 



20 



25 



30 



40 



SO 



TG6 CTC CAG CAC TTC GAG GTG CCT GTT CTC CGT TGT CAC TGACCCCCGC 593 
Trp Leu Gin His Phe Glu Val Pro Val Leu Arg Cys His 
155 160 165 

CTCTTCAGCA AGACCACAGC CATGACAAAC ACCAG6ATGC ATGCTCCTT6 TCCCCTCCCA 653 

CCCGCCTCAT GACCCAGCCT CACAGACCCT CTCAGGCCTC TGACGAGT6A GCGGGTGAAG 713 

TGCCACTGGG TCACCGCAGG GCAGCTGGAA TGGCAGCATG GTAGCGCCTC CTAACA6ATT 773 

AAATAGATCA CATTTGCTTC TAAAATT 800 

SEQ ID NO: 38 
SEQUENCE LENGTH: 167 
SEQUENCE TYPE: aiino acid 
TOPOLOGY: linear 
MOLECULE TYPE: protein 
ORIGINAL SOURCE: 

ORGANISM: Mus Muscukus 

CELL LINE: INC-HAl cell 



SEQUENCE DESCRIPTION: SEQ ID NO: 38 

Met Leu Pro Glu Lys Ala Leu His Gly His 
1 5 10 

Pro Gin Leu Pro Arg Thr Val Pro Thr Arg Ala Ala Met Arg Ala Ala 

15 20 25 

Gly Thr Leu Leu Ala Phe Cys Cys Leu Val Leu Ser Thr Thr Gly Gly 
35 30 35 40 

Pro Ser Pro Asp Thr Cys Ser Gin Asp Leu Asn Ser Arg Val Lys Pro 

45 50 55 

Gly Phe Pro Lys Thr He Lys Thr Asn Asp Pro Gly Val Leu Gin Ala 

60 65 70 

Ala Arg Tyr Ser Val Glu Lys Phe Asn Asn Cys Thr Asn Asp Met Phe 
75 80 85 90 

« Leu Phe Lys Glu Ser Arg He Thr Arg Ala Leu Val Gin He Val Lys 

95 100 105 

Gly Leu Lys Thr Met Leu Glu Val Glu He Gly Arg Thr Thr Cys Lys 

110 115 120 

Lys Asn Gin His Leu Arg Leu Asp Asp Cys Asp Phe Gin Thr Asn His 
125 130 135 



55 
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Thr Leu Lys Gin Thr Leu Ser Cys Tyr Ser Glu Val Trp Val Val Pro 

140 145 150 

Trp Leu Gin His Phe Glu Val Pro Val Leu Arg Cys His 
155 160 165 

SEQ ID NO: 39 

SEQUENCE LENGTH: 21 

SEQUENCE TYPE: nucleic acid 

STRANDEDNESS: single 

TOPOLOGY: linear 

KOLECULE TYPE: synthetic DNA 

SEQUENCE DESCRIPTION: SEQ ID NO: 39 

ACAGACACTG CCCCCACCTG C 

SEQ ID NO: 40 

SEQUENCE LENGTH: 22 

SEQUENCE TYPE: nucleic acid 

STRANDEDNESS: single 

TOPOLOGY: linear 

MOLECULE TYPE: synthetic DNA 

SEQUENCE DESCRIPTION: SEQ ID NO: 40 

AATTTTAGAA GCAAATGTGA TC 

SEQ ID NO: 41 

SEQUENCE LENGTH: 23 

SEQUENCE TYPE: nucleic acid 

STRANDEDNESS: single 

TOPOLOGY: linear 

MOLECULE TYPE: synthetic DNA 

SEQUENCE DESCRIPTION: SEQ ID NO: 41 

ACATGTCGn CGTGCAGnC TTG 

SEQ ID NO: 42 
SEQUENCE LENGTH: 22 
SEQUENCE TYPE: nucleic acid 
STRANDEDNESS: single 
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TOPOLOGY: linear 
MOLECULE TYPE: synthetic DNA 
SEQUENCE DESCRIPTION: SEQ ID NO: 42 
AAnOGCAGA ACTACCT6CA AG 

SEQ ID NO: 43 

SEQUENCE LENGTH: 22 

SEQUENCE TYPE: nucleic acid 

STBANDEDNESS: single 

TOPOLOGY: linear 

MOLECULE TYPE: synthetic DNA 

SEQUENCE DESCRIPTION: SEQ ID NO: 43 

GGTCnCCTG AAGA6GCGG6 GG 

SEQ ID NO: 44 

SEQUENCE LENGTH: 22 

SEQUENCE TYPE: nucleic acid 

STRANDEDNESS: single 

TOPOLOGY: linear 

MOLECULE TYPE: synthetic DNA 

SEQUENCE DESCRIPTION: SEQ ID NO: 44 

CACTGTCCCT ACCCGGGCAG CC 

SEQ ID NO: 45 

SEQUENCE LENGTH: 23 

SEQUENCE TYPE: nucleic acid 

STBANDEDNESS: single 

TOPOLOGY: linear 

MOLECULE TYPE: synthetic DNA 

SEQUENCE DESCRIPTION: SEQ ID NO: 45 

ATCTACCCAA GAAGGCTCG6 CAC 

SEQ ID NO: 46 
SEQUENCE LENGTH: 23 
SEQUENCE TYPE: nucleic acid 
STRANDEDNESS: single 
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TOPOLOGY: linear 

MOLECULE TYPE: synthetic DNA 

SEQUENCE DESCRIPTION: SEQ ID NO: 46 

TCTGTTAGGA GGCGCTACCA TGC 9.1 



SEQ ID NO: 47 
SEQUENCE LENGIfi: 35 
SEQUENCE TYPE: nucleic acid 
STEANDEDNESS: single 
TOPOLOGY: linear 
MOLECULE TYPE: synthetic DNA 
20 SEQUENCE DESCRIPTION: SEQ ID NO: 47 

AATTCGGTAC CA6CTGAAGC TACCCCACCA TGCCC 35 

SEQ ID NO: 48 
SEQUENCE LENGTH: 34 
SEQUENCE TYPE: nucleic acid 
STEANDEDNESS: single 
TOPOLOGY: linear 
MOLECULE TYPE: synthetic DNA 
SEQUENCE DESCRIPTION: SEQ ID NO: 48 
35 AGTCGCTAGC AGAGGAGAAC A6GCACCTCA AAAC 34 

SEQ ID NO: 49 
SEQUENCE LENGTH: 24 
SEQUENCE TYPE: nucleic acid 
STRANDEDNESS: single 
TOPOLOGY: linear 
MOLECULE TYPE: synthetic DNA 
SEQUENCE DESCRIPTION: SEQ ID NO: 49 
TGTTATTAAT GATCTGGAAG CACC 24 

50 



55 Claims 



1 . A protein comprising any one of the amino acid sequences set forth in SEQ ID NOs: 4. 6, 9, or 38, or a protein com- 
prising any one of said amino acid sequences having substitution, deletion, or addition of one or more amino acids 
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and having cancer metastasis potency. 

2. A protein encoded by DNA wtiich hytjridizes with DNA conprising any one of the nucleotide sequences set forth in 
SEQ ID NOs: 3, 5, 7. 8, or 37, said protein having cancer metastasis potency. 

3. A DNA encoding the protein according to claim 1 . 

4. Tlie DNA according to daim 3, comprising any one of the nudeoiide sequences set forth in SEQ ID NOs: 3, 5. 7, 
8, or 37. 

5. A DNA hybridizing with a DNA comprising any one of the nucleotide sequences set forth In SEQ ID NOs: 3. 5, 7, 
8, or 37, encoding a protein having cancer metastasis potency. 

6. A vector can-ying the DNA according to any one of claims 3 to 5. 

7. A transformant hartx)ring the vector according to claim 8. 

8. A method for preparing the protein according to claim 1 or 2, comprising culturing the transformant set forth in daim 
7. 

9. A DNA specifically hybridizing with the DNA according to any one of claims 3 to 5, comprising at least 1 5 nucleotide 
residues. 

10. An antisense DNA aganst the DNA according to any one of claims 3 to 5 or a portion thereof. 

11. An antibody tending to the protein according to claim 1 or 2. 

12. A method for screening a compound having cancer metastasis inhibitory ability, comprising steps of: 

(a) contacting a test sample with the protein according to daim 1 or 2, and 

(b) selecting conpounds having ttie activity to bind to the protein according to daim 1 or 2. 

13. A mettiod for screening compounds having cancer metastasis inhibitory ability, comprising steps of: 

(a) contacting test samples with cells expressing the protein according to claim 1 or 2, 

(b) detecting the expression level of the protein according to daim 1 or 2 in cells contacted with test sanples 
using the arrtllxjdy set forth in daim 1 1 , and 

(c) selecting a compound which reduces the expression level of the protein according to claim 1 or 2 as com- 
pared witii that in cells not contacted with test samples. 
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Figure 1 
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Figure 2 
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Figure 3 
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